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FOREWORD 


At  the  Eleventh  Conference  on  the  Design  of  Experiment?  Dr?, 

Joseph  Cameron  and  Walter  Foster  discussed  the  possibility  of  holding 
the  next  meeting  at  the  National  Bureau  of  Standards.  Talks  with  Dr. 

Badrig  Kurkjian  brought  out  the  fact  that  he  would  be  willing  to  investi¬ 
gate  the  possibility  of  the  Harry  Diamond  .Laboratories  serving  in  the 
role  of  the  second  host.  The  efforts  of  these  three  individuals  brought 
about  the  desired  results.  The  Army  Mathematics  Steering  Committee, 
the  sponsor  of  these  meetings  on  behalf  of  the  Office  of  Chief  of  Research 
and  Development,  was  pleased  to  hear  from  Dr.  Allen  V.  Astin,  Director 
of  the  National  Bureau  ot  Standards,  and  Lt.  Colonel  M.  S.  Hochmuth, 
Commanding  Officer  of  Harry  Diamond  Laboratories,  that  their  organiza¬ 
tions  would  serve  as  joint  hosts  for  the  Twelfth  Conference.  Both  Messrs. 
Astin  and  Hochmuth  graciously  agreed  to  give  welcoming  addresses  at 
the  start  of  the  conference.  Their  talks  set  the  stage  for  this  interesting 
scientific  meeting.  Incidentally,  the  Harry  Diamond  Laboratories  and 
the  National  Bureau  of  Standards  served  as  joint  hosts  for  the  first  three 
conferences  of  this  series.  At  those  meetings,  as  well  as  this  one, 

Mr.  John  Wheeler,  Chairman  on  Local  Arrangements,  is  well  remembered 
by  those  in  attendance  for  his  excellent  execution  of  the  many  details  which 
must  be  handled  for  smooth  running  symposia. 

The  conference  was  conducted  at  the  new  quarters  of  the  National 
Bureau  of  Standards  at  Gaithersburg,  Maryland.  This  afforded  the 
attendees  an  opportunity  to  become  acquainted  with  these  new  facilities, 
and  some  of  the  many  scientific  experiments  being  conducted  by  the  staff 
of  the  Bureau.  They  also  learned  of  some  of  the  types  of  data  which  NBS 
could  furnish  that  would  be  helpful  in  the  conduction  of  their  own  research. 
For  the  benefit  of  those  who  did  not  get  to  this  meeting,  we  mention  here 
some  of  the  special  equipment  now  on  the  Gaithersburg  campus.  There 
are  three  35-foot  grating  spectrographs.  One  operates  in  the  vacuum 
ultraviolet  region,  another  in  the  visible  region,  while  the  third  is  used 
for  the  short  wave  ultraviolet  region.  The  NBS  LINAC  is  a  100  Mec  linear 
electron  accelerator  capable  of  producing  one  of  the.  world's  most  intense 
high-energy  electron  beams.  Neutron  irradiation  experiments  can  be 
conducted  with  the  new  10-megawatt  nuclear  research  facility.  The  world's 
largest  testing  machine,  a  12-million  pound  compression  and  tension  tester, 
is  about  ready  for  use.  This  monster  rises  almost  100  feet  above  its 
base.  These  and  many  other  new  scientific  machines  are  to  be  found  at 
these  well-equipped  laboratories. 

The  program  of  the  Twelfth  Conference  featured  the  following  four 
invited  addresses; 


Operations  Research 

Professor  Brian  W.  Conolly,  Saclant  ASW  Centre 

Statistics  as  a  Diagnostic  Tool  in  Data  Analysis 
Dr.  John  Mandel,  National  Bureau  of  Standards 

Planning  and  Analysis  of  Observational  Studies 
Professor  W.  G.  Cochran,  Harvard  University 

Sample  Censoring 

Professor  Norman  L.  Johnson,  University  of  North  Carolina 
at  Chapel  Hill 

Besides  these  talks,  the  members  of  the  audience  were  able  to  select 
from  24  contributed  scientific  papers  topics  that  best  suited  their  own 
needs.  These  papers  were  presented  in  eight  technical  and  two  clinical 
sessions.  We  are  pleased  to  say  that  Dr.  Frederick  F.  Stephan,  Presi- 
ent  of  the  American  Statistical  Association.,;  was  able  to  attend  the  banquet. 
He  was  called  on  to  present  the  second  Wilks  Memorial  Medal  to  General 
Leslie  E.  Simon. 

This  volume  of  the  Proceedings  contains  24  of  the  papers  which  were 
presented  at  this  meeting.  The  Army  Mathematics  Steering  Committee 
has  asked  that  these  articles  on  modern  principles  on  the  design  of 
experiments,  together  with  the  application  of  these  ideas,  be  made  avail¬ 
able  in  the  form  of  this  technical  manual.  Members  of  this  committee 
take  this  opportunity  to  express  their  thanks  to  the  many  speakers  and 
other  research  workers  who  participated  in  the  conference. 

The  conference  had  an  attendance  of  125  scientists;  and  50  organiza¬ 
tions  were  represented.  Speakers  and  panelists  came  from  George 
Washington  University,  Harvard  University,  the  National  Bureau  of 
Standards,  the  National  Institutes  of  Health,  North  Atlantic  Treaty  Organi¬ 
zation,  North  Carolina  State  University  at  Raleigh,  Phillips  Petroleum 
Company,  Stanford  University,  University  of  California  at  Los  Angeles, 
University  of  Chicago,  University  of  Georgia,  University  of  Michigan, 
University  of  North  Carolina  at  Chapel  Hill,  University  of  Wisconsin, 
Virginia  Polytechnic  Institute  and  thirteen  Army  facilities. 

The  Chairman  wishes  to  express  his  appreciation  to  his  Advisory 
Committee  (Joseph  Cameron,  F.  G.  Dressel,  Walter  D.  Foster,  Bernard 
Greenberg,  Boyd  Harshbarger,  J,  S.  Hunter,  H.  L.  Lucas,  Jr.,  Clifford 
Maloney  and  Henry  B.  Mann)  for  their  assistance  in  formulating  the  pro¬ 
gram  and  selecting  the  invited  speakers. 


Frank  E.  Grubbs 
Conference  Chairman 
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TWELFTH  CONFERENCE  ON  THE  DESIGN  OF  EXPERIMENTS 
IN  ARMY  RESEARCH,  DEVELOPMENT  AND  TESTING 

19-21  October  1966 

Wednesday,  19  October 

0900-0930  REGISTRATION  -  Administration  Building,  Main  Foyer 

0930-1000  OPENING  OF  THE  CONFERENCE  -  Admin.  Bldg.  , 

Green  Auditorium 

John  Wheeler,  Chairman  on  Local  Ar rangements , 
Harry  Diamond  Laboratories,  Washington,  D.  C. 

WELCOME 

Dr.  Allen  V.  Astin,  Director 
National  Bureau  of  Standards 


Colonel  M.  S.  Hochmuth,  Commanding  Officer 
Harry  Diamond  Laboratories 

1000-1230  GENERAL  SESSION  I,  Green  Auditorium 

Chairman:  Professor  Boyd  Harshbarger,  Department  of 
Statistics,  Virginia  Polytechnic  Institute ,  Blacksburg,  Va. 

OPERATIONS  RESEARCH 

Professor  Brian  W.  Conolly,  North  Atlantic  Treaty 
Organization,  Saclant  ASW  Research  Centre 

STATISTICS  AS  A  DIAGNOSTIC  TOOL  IN  DATA  ANALYSIS 
Dr.  John  Mandel,  Materials  Evaluation  Laboratory, 
National  Bureau  of  Standards,  Gaithersburg,  Maryland 

1230-1330  LUNCH 


Technical  Sessions  I  and  II  will  start  at  1330  and  run  to  1500.  After 
a  break  Technical  Sessions  III  and  IV  will  convene  at  1  530  and  end  at  1700. 
The  social  hour  will  begin  at  1730.  The  banquet  is  scheduled  for  1830. 
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1330-1500  TECHNICAL  SESSION  I  -  Lecture  Room  A 

Chairman;  Rf.rtT.o-.ry_  Harry  Diamond  Laberatoric s , 
Washington,  D.  C. 

STEPWISE  MULTIPLE  REGRESSION  STATISTICAL  THEORY 
AND  COMPUTER  PROGRAM  DESCRIPTION 

Harold  J.  Breaux,  Ballistic  Research  Laboratories, 
Aberdeen  Proving  Ground,  Maryland 


ESTIMATION  OF  ERROR  RATES  IN  DISCRIMINANT 
ANALYSIS 

Peter  A.  Lachenbruch  and  M.  Ray  Mickey,  University  of 
North  Carolina  at  Chapel  Hill,  North  Carolina  and 
University  of  California,  Los  Angeles,  California 
Representing  the  Army  Research  Office -Durham 

1330-1500  TECHNICAL  SESSION  II  -  Lecture  Room  B 

Chairman:  Henry  A.  Dihm,  U.  S.  Army  Missile  Command, 
Redstone  Arsenal,  Alabama 

SOME  STATISTICAL  APPLICATIONS  IN  THE  TESTING  OF 
MILITARY  RUBBER  PRODUCTS 

Emil  H.  Jebe,  Willow  Run  Laboratories,  Institute  of 
Science  and  Technology,  The  University  of  Michigan. 
Representing  the  U.  S.  Army  Tank-Automotive  Center, 
Warren,  Michigan 

A  STATISTICAL  ANALYSIS  OF  PROVISIONING  PROCESSES 
ON  FOUR  ARMY  MISSILE  SYSTEMS 

Robert  G.  Provost,  U.  S,  Army  Missile  Command, 
Redstone  Arsenal,  Alabama 

1500-1530  BREAK 
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1530-1700  TECHNICAL  SESSION  III  -  Lecture  Room  A 


Chairman:  Henry  Ellner,  Directorate  for  Quality  Assurance , 
U.  S.  Army  Edgewood  Arsenal,  Edgewood  Arsenal,  Md. 
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OPTIMAL  ECONOMY  IN  PLANNING  EXPERIMENTS 

Regina  C.  Elandt-Johnson,  University  of  North  Carolina 
at  Chapel  Hill,  North  Carolina.  Representing  the  Army 
Research  Office -Durham. 
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TECHNICAL  SESSION  III  (continued) 


ON  A  CLASS  OF  NONPARAMETRIC  TESTS  FOR  MANOVA 
TM  TWOWAY  LAYOUTS 

Pranab  Kumar  Sen,  University  of  North  Carolina  at 
Chapel  Hill,  North  Carolina  and  the  University  of 
Calcutta,  Representing  the  Army  Research  Office -Durham 

1530-1700  TECHNICAL  SESSION  IV  -  Lecture  Room  B 

Chairman:  David  Hogben,  Statistical  Engineering  Laboratory, 
National  Bureau  of  Standards,  Gaithersburg,  Maryland 

TESTS  FOR  OUTLIERS 

H.  A.  David,  University  of  North  Carolina  at  Chapel  Hill, 
North  Carolina.  Representing  the  Army  Research  Office- 
Durham 

THE  PROBABILITY  OF  SURVIVAL  OF  A  SUBTERRANEAN 
TARGET  UNDER  INTENSIVE  ATTACK 

Bernard  Harris,  Herman  F.  Karreman,  and  J.  Barkley 
Rosser,  Mathematics  Research  Center,  U.  S,  Army, 
University  of  Wisconsin,  Madison,  Wisconsin 

1730-1830  SOCIAL  HOUR  -  Country  Squire  Room,  Washingtonian  Country 

Club,  Gaithersburg,  Maryland 

1830-  BANQUET  -  (As  above) 

Presentation  of  the  Samuel  S,  Wilks  Memorial  Award 


Thursday,  20  October 


Technical  Sessions  V  and  VI  will  run  from  0900-1020.  After  the  break 
Technical  Session  VII  and  Clinical  Session  A  will  start  at  1050  and  run  to 
1230.  After  lunch  Technical  Session  VIII  and  Clinical  Session  B  will 
convene  at  1330  and  end  at  1520,  After  a  half  hour  break  General  Session  2 
is  scheduled  for  1550  to  1700. 


0900-1020  TECHNICAL  SESSION  V  -  Lecture  Room  A 


Chairman;  Selig  Starr,  Mathematics  Branch,  Office  of  the 
Chief  of  Research  and  Development,  Washington,  D.  C. 


TECHNICAL  SESSION  V  (continued) 


SINGLE  DEGREE  OF  FREEDOM  ORTHOGONAL  COMPONENTS 
OF  A  FACTOR  AT  2k  LEVELS  IN  TERMS  OF  LINEAR  COM¬ 
BINATIONS  OF  THE  2K  CONTRASTS  OF  K  FACTORS  AT  2 
LEVELS 

Joseph  Weinstein,  Electronic  Components  Laboratory, 

U.  S.  Army  Electronics  Command,  Fort  Monmouth,  N.  J. 

CONDITIONAL  EFFECTS  AND  INTERACTION  IN  SYMMET¬ 
RICAL  FACTORIAL  CONFOUNDING  WITH  APPLICATIONS 
TO  BIOLOGY 

N.  R,  Bohidar,  Biomathematics  Division,  Fort  Detrick, 
Frederick,  Maryland 

0900-1020  TECHNICAL  SESSION  VI  -  Lecture  Boom  B 

Chairman:  James  B.  Duff,  U.  S.  Army  Engineering  Research 
and  Development  Laboratory,  Fort  Belvoir,  Virginia 

THE  NEGATIVE  BINOMIAL  DISTRIBUTION  APPLIED  TO 
ATMOSPHERIC  PARAMETERS 

Oskar  M.  Essenwanger,  U.  S.  Army  Missile  Command, 
Redstone  Arsenal,  Alabama 

TRIAL  VARIABILITY  INTERPRETED  AS  DIFFERENCES  IN 
TRANSLATION  OR  ROTATION  IN  FUNCTION  ANALYSIS  OF 
VARIANCE 

Walter  D.  Foster,  U.  S.  Army  Biological  Laboratories, 

Fort  Detrick,  Frederick,  Maryland 

1020-1050  BREAK 

1050-1230  TECHNICAL  SESSION  VII  -  Lecture  Room  A 

Chairman:  A.  Bulfinch,  U.  S.  Army  Munitions  Command, 
Picatinny  Arsenal,  Dover,  New  Jersey 

SIMULATION  OF  BIO -CELLULAR  ANIMAL  SYSTEMS 

George  I.  Lavin,  Terminal  Ballistic  Laboratory,  Ballistic 
Research  Laboratories ,  Aberdeen  Proving  Ground, 

Maryland 
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TECHNICAL  SESSION  VII  (continued) 


A  METHOD  FOR  ADJUSTING  FOR  PARTICLE  SIZE  IN  THE 
X-RAY  FLOURESCENCE  ANALYSIS  OF  A  MULTICOMPO- 
'  NENT  MIXTURE 

R.  H.  Myers,  Virginia  Polytechnic  Institute,  Blacksburg, 

*  Virginia,  and  Donald  E.  Womeldorph,  Phillips  Petroleum 

Company.  Representing  the  Army  Research  Office-Durham 

1050-1230  CLINICAL  SESSION  A  -  Lecture  Room  B 

Chairman:  Fred  Frishman,  Mathematics  Branch,  Office, 
Chief  of  Research  and  Development,  Washington,  D.  C. 

Panelists: 

Mr.  O.  P.  Bruno,  Surveillance  &  Reliability  Laboratory, 

U.  S.  Army  Ballistic  Research  Laboratories,  Aberdeen 
Proving  Ground,  Maryland 

Professor  A,  C.  Cohen,  Jr.  ,  Institute  of  Statistics, 
University  of  Georgia,  Athens,  Georgia 

lV  Professor  Boyd  Harshbarger,  Statistical  Laboratory, 

Virginia  Polytechnic  Institute,  Blacksburg,  Virginia 

•  Dr.  Joan  R.  Rosenblatt,  Statistical  Engineering  Laboratory, 

National  Bureau  of  Standards,  Gaithersburg,  Maryland 

Professor  Herbert  Solomon,  George  Washington  University 
Washington,  D.  C.  and  Stanford  University,  Stanford, 
California 

THE  PROBLEM  OF  DETERMINING  THE  CONFIDENCE 
LEVEL  FOR  SOME  TIME  INDEPENDENT  SYSTEM 
,  RELIABILITY  ESTIMATES  WHEN  ATTRIBUTE  DATA  FOR 

THE  SYSTEM  SUB -COMPONENTS  ARE  GIVEN  (A  PROPOSED 
SOLUTION  AND  APPROXIMATION  TECHNIQUE) 

Eugene  F.  Dutoit,  Picatinny  Arsenal,  Dover,  New  Jersey 

STATISTICS,  PROBABILITY,  AND  DETERMINISM  IN  A 
RELIABILITY  IMPROVEMENT  PROGRAM 

Woodie  R.  Jenkins,  Jr.  ,  National  Range  Operations, 

,  White  Sands  Missile  Range,  New  Mexico 

1230-1330  LUNCH 
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1330-1520  TECHNICAL  SESSION  VIII  -  Lecture  Room*  A 

Chairman:  Cyrus  Martin,  U.  S.  Army  Engineering  Research 
and  Development  Laboratory,  Fort  Belvoir,  Virginia 

A  COMPUTERIZED  PROCEDURE  FOR  WRITING  MATHE¬ 
MATICAL  MODELS  FOR  SYSTEMS  RELIABILITY 
Anthony  J.  Ricciardi  and  John  G.  Mardo,  Nuclear 
Reliability  Division,  Picatinny  Arsenal,  Dover,  N.  J. 


1330-1520  CLINICAL  SESSION  B  -  Lecture  Room  B 

Chairman:  Albert  Parks,  Harry  Diamond  Laboratories, 

Washington,  D.  C. 

Panelists: 

Professor  Bernard  Greenberg,  Department  of  Biostatis- 
tics,  University  of  North  Carolina,  Chapel  Hill,  N.  C. 

Dr.  Frank  E.  Grubbs,  Ballistic  Research  Laboratories, 

Aberdeen  Proving  Ground,  Maryland 

/ 

Professor  William  Kruskal,  Department  of  Statistics, 

The  University  of  Chicago,  Chicago,  Illinois 

• 

Professor  H.  L.  Lucas,  Jr.,  Institute  of  Statistics, 

North  Carolina  State  College,  Raleigh,  North  Carolina 

Dr.  Henry  B.  Mann,  Mathematics  Research  Center, 
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OPERATIONS  RESEARCH 


Prnfo««nr  Briar.  V/.  Goi'iuily-: 

North  Atlantic  Treaty  Organization 
Saclant  ASW  Research  Center 

INTRODUCTION. 

1.  It  is  a  privilege  and  a  pleasure  to  be  invited  to  make  a  presentation 
on  Operational  Research  at  a  specialist  statistical  conference.  Those 
individuals  who  choose  to  make  Operational  Research  their  profession 
come  from  the  ranks  of  engineers,  physicists,  chemists,  mathematicians 
as  well  as  statisticians.  All  have  a  contribution  to  make  to  Operational 
Research.  I  myself,  for  example,  am  a  mathematician  by  basic  training, 
with  a  pronounced  interest  in  obtaining  practical  and  verifiable  solutions  to 
real  life  problems, 

2.  The  name  Operational  Research  is  itself  perhaps  not  a  very  good 
description  of  the  type  of  activity  that  0.  R.  workers  usually  undertake.  I  do 
not  propose  to  be  so  controversial  as  to  suggest  an  alternative.  My  theme 

is  rather  to  suggest  that,  as  it  has  developed,  modern  0.  R.  has  come  to 
depend  more  and  more  heavily  on  the  science  and  techniques  of  statistics  and 
probability  theory.  And  it  is  not  difficult  to  see  why  this  is  so. 

3.  In  O.  R.  we  are  usually  concerned  with  studying  the  workings  of  a 
complex  system  or  process  such  as  the  manufacture  of  an  automobile;  the 
organization  of  an  airport;  the  routing  of  city  traffic;  a  telephone  exchange; 
the  detection,  classification  and  destruction  of  an  enemy  target.  If  we  like 
to  call  these  systems  or  processes  "operations",  and  the  study  we  make  of 
them  "research",  then  we  arrive  at  the  name  Operations  Research  by  which 
O.  R.  is  designated  in  the  U.  S.  The  fact  that  0,  R.  is  called  "Operational 
Research"  in  Europe  is  presumably  by  analogy  with  our  practice  to  call 
research  in  physics  physical  research,  and  research  in  mathematics 
mathematical  research. 

4.  The  objective  of  0.  R.  studies  is  normally  to  discover  how  to  optimize 
in  some  sense  the  output  of  the  process;  e,  g.  produce  an  adequate  automo¬ 
bile  at  a  minimal  cost;  achieve  an  airport  organization  which  maximizes 
passenger  flow  with  a  minimum  of  incovenience  and  the  best  employment  of 
facilities;  maximize  the  probability  of  destruction  of  the  enemy  target.  In 
order  to  do  this  we  have  to  try  to  understand  the  structure  of  the  process. 
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5.  The  complex  processes  which  O.  R.  investigates  are  normally 

decomposable  into  a  number  of  subsidiary  on  each  cf  which 

the  ultimate  output  depends.  If  one  regards  these  as  parameters  of  the 
system  as  a  whole,  then  the  study  consists  first  in  determining  their  inter¬ 
relation,  and  the  way  they  affect  the  output.  This  leads  to  the  creation  of 

a  more  or  less  mathematical  model  -  a  set  of  equations  which  characterises 
the  process.  If  the  model  is  verified  in  the  sense  that  it  can  be  used  to 
predict  measurable  outputs,  then  the  analysis  of  the  process  and  its  opti¬ 
mization  reduces  to  the  application  of  appropriate  mathematical  techniques 
to  the  model. 

6.  One  reason  why  modern  O,  R.  has  come  to  be  heavily  dependent  on 
probability  and  statistics  is  the  greater  recognition  of  the  need  to  assess 

the  effect  of  chance  on  the  outcome  of  a  process;  rather  than  to  work  through¬ 
out  with  average  values.  Nowadays  we  are  interested  in  the  probability 
distributions  of  the  outcomes  of  the  subsidiary  processes  in  order  to  discover 
the  probability  distribution  of  the  overall  outcome.  Under  these  circum¬ 
stances  we  have  to  deal  with  stochastic  processes  and  our  analysis  depends 
on  the  specialized  techniques  developed  by  the  experts. 

7.  I  think  that  in  fact  O.  R.  and  statistics  have  much  to  offer  each  other. 
Erlang  was  a  Danish  engineer  and  an  O.  R.  worker  whose  interest  was  the 
Danish  telephone  service.  His  work  in  the  early  20th  century  founded  queue¬ 
ing  theory  which  in  all  its  increasing  complexity  is  the  subject  of  many 
research  papers  published  in  both  statistical  and  O,  R.  research  periodicals. 
Those  who  are  concerned  with  military  exercies  know  that  one  has  to  deal 
with  experiments  whose  design  cannot  be  altogether  controlled,  that  the 
samples  are  small,  and  the  variables  many;  a  situation  shocking  to  a  classi¬ 
cal  statistician,  but  a  challenge, 

8.  During  the  remainder  of  this  presentation  I  intend  to  be  more 
specific.  In  order  to  Illustrate  my  thesis  of  the  statistical  interest  which 
is  to  be  found  in  O,  R,  studies  and  the  dependence  of  the  analysis  on  statis¬ 
tical  expertise  I  shall  describe  two  problems  from  a  military  O.R.  context, 
which  I  hope  you  will  find  entertaining. 

PROBLEM  1. 

9.  During  anti-submarine  operations  there  inevitably  occur  events 
which  have  a  nuisance  value,  and  which  one  would  like  to  eliminate.  The 
elimination  is  partly  a  matter  of  equipment  design,  and  partly  of  training 
in  its  use. 

10.  For  the  purposes  of  this  presentation  I  am  concerned  only  with 
finding  a  simple  stochastic  process  which  describes  the  occurrence  of  the 
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events  in  time  in  the  hope  that  such  a  description  may  throw  light  on  the 
basic  phenomenon.  I  have  no  vartimA  ■'••cords  of  the  events,  tliuugh  I  know 
they  have  always  occurred,  I  am  therefore  dependent  on  naval  exercises 
fo;  dat-i. 


11.  Suppose,  then,  that  I  have  obtained  from  the  records  of  one  ship 

during  a  recent  exercise  the  times  tn  of  occurrence  of  the  events  which  I 
shall  denote  by  £n(n>  1).  1  measure  time  from  the  beginning  of  the  exercise. 

An  immediate  difficulty  arises  out  of  the  fortunate  fact  that  the  En  do  not 
occur  at  a  tremendously  high  rate.  Three  per  day  might  be  a  typical  average 
taken  over  all  ships.  Exeicises  of  the  right  sort  do  not  take  place  frequently 
and,  when  they  do,  they  are  of  a  limited  duration.  Thus,  typically,  at  the 
end  of  a  week  I  might  have  a  few  tens  of  events  for  each  ship.  At  the  begin  - 
nig  I  want  to  consider  each  ship's  records  separately,  so  my  sample  is  not 
very  great. 

12.  Adopting  the  good  practice  of  making  a  simple  initial  hypothesis 
1  look  at  the  time  series  (tn)  for  each  ship  and  ask  if  there  are  indications 
that  the  events  (which  a  priori  might  be  thought  of  as  having  random  origin) 
occur  in  a  Poisson  stream.  The  answer  is  that  they  do  not  appear  to  do  so, 
but  rather  that  in  all  cases  there  is  evidence  of  cluster  (a  preponderance  of 
short  inter-event  time  intervals  as  compared  with  a  Poisson  stream  with 

the  same  mean).  Moreover  the  mean  intervals  of  the  event  distributions  seem 
to  be  quite  different  from  each  other,  and  1  do  not  find  evidence  which  supports 
the  hypothesis  that  the  En  for  ship  A  could  be  generated  by  adjustment  of  the 
mean  from  the  stream  of  En  for  ship  B. 

13.  Since  there  appears  to  be  clustering  I  next  ask  myself  if  a  particular 
stream  En  could  have  been  generated  by  a  contagious  process,  and  for  this 
purpose  I  choose  a  Pdlya  process  defined  in  the  following  form*  Suppose  that 
the  process  begins  at  time  t=0  and  that  no  event  E  occurs  at  that  time.  The 
instantaneous  probability  that  the  (n+l)^  Polya  event  takes  place  in  the  small 
interval  (t,  t+dt)  is  given  by  Pn+i(t)dt  where 

Pn+l^  =  (l+aXt)^  * 


The  parameters  X  and  a  are  supposed  to  be  real,  X  is  positive  and  a  non¬ 
negative.  There  is  apparently  no  other  restriction  on  a,  though  I  shall  make 
some  more  remarks  on  that  subject  later.  When  a  is  zero  the  Po'lya  process 
clearly  becomes  a  Poisson  process  with  mean  interval  length  l/X  time  units. 

14.  I  now  give  without  proof  a  few  key  theoretical  results  of  a  Pdlya 
process  as  defined.  Let  Pn(T)  be  the  probability  that  exactly  n  Pdlya  events 
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occur  during  the  interval  (0,T)  and  define  p  (0)=1.  The  generating  function 
P(x,T)  of  Pn(T)  is  then  ° 

(14.1)  P(x ,  T )  =  [  1+aX  T(l-x)]  “l/a  . 


The  mean  and  variance  of  n  are 
(14.2)  E(n)  =  \  T. 

(14.  3)  Var(n)  =  a  \  2T2  +  X  T  . 


The  exercise  data  give  me  a  series  of  time  intervals  between  eventB. 
likelihood  of  a  series  of  n  events  occurring  at  the  instants  t^,  t^,  .  .  .  , 


(14.4) 


Pn^l*  V  tn)  = 


n-1 

n 

i 


(l+a  X  tn) 


(1+ra) 

'  n+l/a 


The 

t  is 
n 


The  maximum  likelihood  estimator  of  X  is  simply  n/t  ,  but  that  of  a  is 
more  complicated. 

15.  The  distribution  of  the  intervals  between  events  is  of  particular 

interest.  It  turns  out  that,  for  any  n,  the  p.d.  f,  g  (t  )  of  the  interval 

th  th  ^ 

between  the  n  “  and  the  (n+1)  events,  whenever  the  first  n  events  took 

place,  is 


/ 


4 


(15.  1) 


gjr)  = 


(l+a  Xt) 


7^75 


and  is  independent  of  n.  The  interpretation  of  this  is  that  if  we  generate 
a  lot  of  Polya  processes,  each  having  the  same  parameters  a  and  X  ,  and 
then  examine  the  time  intervals  between,  say,  the  second  and  third  events 
in  all  the  processes,  we  should  find  that  they  all  are  distributed  according 
to  (15.  1). 

16.  The  moment  p  about  zero  of  an  interval  between  two  given 
events  is 


4 


(16.1) 


= 


\  1 


■I1-**; 


Th!  .  J?y  M  °nly  lf  n°ne  0f  the  terms  in  the  denominator  is 
infinit/lf  ^  fsl  *he  “terval  length  distribution  has  all  its  moments 
infinite  if  a  «  the  first  moment  is  finite,  but  none  of  the  higher 
moments  is.  • 

17  I  return  to  my  fundamental  problem.  I  have  an  observed  series 
of  events  and  I  want  to  make  statements  about  the  hypothesis  that  they  are 

ET *  *°  *  P°lya  Pr0”S*'  H°"  *>  1  — —  S  -  *  from* 

18.  In  order  to  throw  light  on  this  problem  sequences  of  Polya  intervals 
were  generated,  each  having  the  same  a  and  X.  The  idea  was  to  compare 

L  th*  A  °btai"ed  by  various  mean*  with  their  known  values. 

In  fact  the  problem  of  estimation  remains  open,  but  some  features  of  Pdlya 
processes  have  been  revealed  which  were  a  surprise  to  me.  V 

19*  Th®  digital  computer  generation  of  the  Prflya  intervals  was  carried 
out  as  follows.  Suppose  that  n  events  have  been  generated  and  that  thev 
occurred  at  time.  V  ^  . .  .tn.  We  require  the  probability  * 

bn+l  ^n+l^l '  *2 ’ '  *  ’f'n^dtn+l  t^e  (n+l)tb  event  occurs  in  the  interval 

^n+1'  tn+l+dtn+l^‘  glven  that  the  first  n  occurred  at  times  t,  ,t  ,. .  ,t 
Clearly,  1  Z  n 

Pn' i  *2  n 

La; r„«. in  4t'  •*y<  -* 


A(l+na)(l+a\t  )n+1/a 


(1+aXt^+aXT) 


n+l+l/a 


The  conditional  probability  that  the  (n+l)th  interval  is  less  than  T  is; 
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V  *  '•  -7 


n+1* 


V  2'  *  ’ 


I  l+a\t  +aAT  I 

*■  n  J 


To  obtain  the  intervals  one  generates  a  sequence  of  random  numbert.  r 
independently  and  uniformly  distributed  on  0  <  <  1 ,  and  then  solves0 


(19.4) 


n+1 


for  r 


n+1 

EXPERIMENT  1  . 


[1+aAt  1 

- 2 - 

1+aAt  +aAr 

n  n+1  «* 


n+l/a 


20,  I  am  now  going  to  describe  briefly  some  of  the  experiments 
which  were  carried  out.  For  the  first  we  generated  5  independent  sequences 
of  1000  Polya  intervals,  for  each  of  which  we  assigned  A=4/30,  a  . 

This  was  to  give  l/A  (l-a),  the  mean  interval  length,  the  value  10  which 
corresponded  with  observation,  The  means  and  standard  deviations  of  the 
interval  lengths  were  as  follows. 


Means  and  Standard  Deviations  of  5  Independent 
Sequences  of  1000  Polya  Intervals 
with  the  same  Parameters 


Sequence  No, 

Mean 

Standard  Deviation 

1 

2,708 

3.  164 

2 

21.804 

22.964 

3 

6,428 

6.874 

4 

7.  349 

8.  304 

5 

9.519 

10. 290 

This  Table  was  the  first 

surprise. 

We  expected  each  sequence  to  ! 

a  mean  and  standard  deviation  reasonably  close  to  the  theoretical  values  of 
10  and  10 V~2  respectively.  The  first  sequence  was  also  "looked  at"  just 
after  the  100^  event  and  the  means  and  standard  deviation  were  found  to  be 


2.450  and  2,615  respectively.  Thus  it  appeared  that  the  processes  were 
settling  down  to  a  steady  state  quite  rapidly,  but  a  steady  state  which  could 
be  vastly  different  from  one  process  to  another,  even  thoiigh  the  parameters 


were  the  same.  On  the  face  of  it,  then,  it  appears  that  estimation  of  a 
and  X  based  on  a  perfectly  valid  sequence  might  well  give  completely 

C  .  _ _  _  ll  i  1  a  ■  • 

— --- *  —  ~  ~  uvn*  wiuov  v  umiucu  iiujiu  anuuicr  cqudliy  Veil  1  a  Sequence* 

21.  The  measure  used  of  the  mean  interval  after  the  event  at  time 

t  was 
n 

(21 . 1)  x  e  t  /n  . 

n' 


The  sampling  density  function  of  x  is: 

-1  n“l 

(nX)  xn  .  n  (l+ra) 

(21.2)  r=l 

(n-1).'  {l+naXx)n+1/a 

and  the  expectation  and  variance  of  x  are  respectively 

E(x)  =  l/X  (1-a)  ; 

(21. 3) 

Var(x)  a  (1-a+an)  /  [nX2(l-a)2(l-2a)]  , 


Now  the  mean  of  the  five  sequence  means  is  9.  561  and  the  standard  devia¬ 
tion  is  about  7.  The  variation  of  the  sequence  means  is  thus  less  surpris¬ 
ing,  but  no  less  discomforting. 

22.  The  theoretical  reason  for  the  stability  of  Polya  sequences  about 
widely  differing  means  seems  to  be  that  the  whole  pattern  of  a  sequence  is 
on  the  average  governed  by  the  first  interval.  This  can  be  seenbv  con- 
sidering  the  conditional  expected  value  E^/t^t^  ,  tft)  of  the  (n41)th  interval 

r  ,  given  tho  times  t^.t^- . .  t^  of  occurrence  of  the  first  n  events.  Then 


(22.  1) 


E(f  /tj ,  t . 


1+aXt 


. .  .t  ) 
r 


X  { 1  +a(  n  - 1 ) } 


(3  (t  ) 
n*  n' 


Thus  the  conditional  expected  value  of  the  second  interval  is  (l+aXt.)/X. 
If  tj  is  greater  than  its  expected  value  l/X  ( 1-a)  then  1 
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1 

A(i-a) 


E(tA,)  >  ~  +  >  / -f 

1  A  Ay  1  -rl; 

i.e.  the  second  interval  also  tends  to  be  greater  than  its  expected  value. 

And  so  on  for  all  successive  intervals. 

EXPERIMENT  2. 

2  3.  The  second  experiment  was  an  extension  of  the  first.  500 
independent  sequences  of  500  Polya  intervals  were  generated,  all  having 
the  same  parameters  a=^-,  A  =  4/i0.  We  were  looking  for  something 
constant  in  all  the  sequences.  Since  the  value  of  the  instantaneous 
probability  density  of  an  event,  just  after  the  generation  of  the  nth  event 
is 

^n+l^n*  = 

and  since  we  were  measuring  the  mean  interval  length  by  the  estimator 
tn/n  we  felt  that  the  product  ?n+j(*n)  *  (t^/ n)  should  be  constant  (l)  for 

long  enough  sequences.  This  turns  out  to  be  the  case.  The  table  [See 
Table  1  near  the  end  of  this  article.  ]  shows  some  typical  values  correspond¬ 
ing  to  the  500th  event  in  each  sequence.  The  products  are  all  very  close 
to  the  theoretical  value  l/A,  in  this  case  30/4  =  7.  5.  Unfortunately  this 
constancy  is  not  of  much  practical  use.  It  does  provide  some  feeling  that 
the  computer  program  is  working  as  it  should, 

24.  It  was  also  decided  to  group  all  the  250  000  intervals  into  a  histo¬ 
gram  which  is  shown  in  Table  2.  If  we  m-ike  the  hypothesis  that  this 
represents  a  random  sample  from  the  event-independent  distribution  of 
Pdlya  intervals 

P^(dT  )  =  Ad-r/(  l+aAT)^+1^a 

the  mean  and  standard  deviations  are  theoretically  10  and  lo/T,  and  the 
observed  values  look  close.  But  are  they  close  enough  on  the  basis  of 
250  000  observations?  I  cannot  answer  that  question  at  the  moment. 

25.  The  observed  frequencies  in  cells  of  one  time  unit  long  are 
tabulated  in  the  column  "observed",  while  the  "expected"  frequencies  were 
calculated  on  the  basis  of  the  event-independent  distribution.  The  last 
column  gives  \^.  Overall  this  is  enormous.  There  is  a  deficiency  of 


A(l +an) 
1+aAt 

n 
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observed  short  intervals  and  an  excess  of  long  ones.  There  are  also 
other  oddities.  A  Poisson  process  with  the  same  mean  (9.866.  ..)  would 
give  a  frequency  of  about  24  000  in  the  (0,1)  range,  so  at  least  there  is 
evidence  of  the  clustering  one  expects  in  a  Polya  process.  1  think  perhapB 
that  the  sample  cannot  be  considered  random  and  independent,  and  this 
may  be  the  explanation  for  the  poor  agreement.  We  also  produced  a 
histogram  of  the  500  process  means  and  this  is  available  if  anyone  is 
interested. 

EXPERIMENT  3. 

26.  Our  faith  in  the  theory,  of  the  event  independent  interval  distri¬ 
bution  was  a  little  shaken  by  the  previous  experiment.  The  next  experiment 
was  conducted  in  order  to  restore  confidence.  1000  independent  Polya 
process  (with  X  =  4/30  and  a  =  as  usual)  were  generated  as  far  as  the 
12*^  interval.  For  each  process  the  lengths  of  the  41*1  and  of  the  1  l**1 
intervals  were  grouped  into  histograms.  These  are  shown  in  Tables  3  and 
4.  We  did  not  instruct  the  computer  to  group  cells  with  low  frequencies, 
but  even  so  there  is  satisfactory  behaviour  according  to  the  hypothesis  of 
the  event  independent  distribution. 

EXPERIMENT  4. 

27.  We  have  carried  out  various  other  experiments.  The  last  which 

I  will  mention  concerns  the  correlation  between  intervals  in  a  Polya  process. 
Theoretically  we  appeared  to  find  that  the  correlation  between  any  pair  of 
intervals  is  a,  provided  that  a  <  j  .  For  a  >  ^  there  is  trouble  over  the 
convergence  of  the  integrals  for  the  second  moments. 

28.  Table  5  concerns  sequences  of  Polya  intervals  for  fixed  X  =  4/30 
and  a  varying  from  0.  1  to  0,  9.  For  each  a,  1000  sequences  were  generated 
and  the  Table  gives  the  mean  and  standard  deviation  of  the  first  and  tenth 
intervals,  the  mean  value  of  the  product  of  these  intervals  ("prod")>  and 
finally  the  correlation  coefficient  calculated  from  observed  values. 

29.  Without  information  on  the  sampling  distribution  of  the  correlation 
coefficient  it  is  difficult  to  make  meaningful  statements  about  these  results. 
There  are  signs  of  agreement  between  theory  and  observation  for  a  =  0.  1, 

0.2  and  0.  3  .  For  a  >  0.  5  the  second  moments  do  not  exist,  in  theory,  and 
a  certain  wildness  will  be  observed  in  the  results. 

30.  This  concludes  my  description  of  some  experiments  with  Polya 
processes.  We  have  subsequently  formed  the  opinion  that  the  Polya  process 
is  not  a  good  model  for  the  natural  phenomenon,  but  we  do  feel  that  it  has 
been  interesting  to  study  the  behaviour  of  the  processes.  I  feel  there  is 
room  for  a  good  deal  more  statistical  investigation  of  these  processes. 

') 


For  example  the  problems  of  parameter  estimation  and  sampling  distri¬ 
butions  are  still  open,  not  to  mention  the  interpretation  of  the  apparent 
i cut* ic Lion  on  a  which  results  from  the  non-existence  of  some  of  the 
moments  for  certain  values.  Perhaps  some  of  you  know  the  answers  to 
these  questions,  and  if  I.  as  a  representative  of  Operational  Research, 
have  called  your  attention  to  a  typical  O.  R.  investigation  where  expert 
statisticai  advice  is  needed  1  have  succeeded  in  my  objective. 

PROBLEM  2. 

31.  My  second  example  concerns  a  tactical  problem.  We  were 
interested  in  a  situation  in  which  a  tactical  unit  has  the  task  of  penetrat¬ 
ing  a  barrier  patrolled  by  opposing  forces.  For  the  purposes  of.  the 
example,  the  barrier  forces  will  be  regarded  as  a  point  which  moves 
according  to  the  general  rules  along  a  line  perpendicular  to  the  general 
expected  direction  of  penetration  of  the  opposing  forces. 

32.  The  situation  is  illustrated  in  the  next  figure. 


The  area  of  interest  is  the  rectangle  ABCD.  The  line  EF  is  patrolled  by 
the  barrier  forces  S.  Its  opponent  P  has  the  task  of  moving  from  some 
point  on  the  boundary  AB  to  CD.  That  is  to  say,  P  wants  to  traverse  EF 
without  being  intercepted  by  S. 

33.  S,  the  intercepter,  is  provided  with  exact  information  about  the 
whereabouts  of  P  either 

(a)  continuously ; 

(b)  at  regularly  spaced  intervals; 

(c)  at  random  intervals  having  a  negative  exponential 
distribution. 

P,  the  penetrator,  is  supposed  to  have  a  number  of  penetration  strategies, 
for  instance: 
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(a)  a  straight  unvarying  track  from  A  to  C ; 

(b)  a  track  composed  of  a  straight  portion  and  one  change 
of  course  at  an  arbitrarily  selected  moment  before 
reaching  EF , 

(c)  a  random  zig-zag. 

Strategies  are  also  postulated  for  S.  It  can,  for  example, 

(a)  Predict  the  track  of  P  on  the  basis  of  the  most  recent 
information,  and  strive  to  reach  the  point  of  intersection 
of  that  track  with  EF  in  order  to  intercept  P. 

(b)  Attempt  to  equate  its  x-coordinate  with  the  last  reported 
coordinate  of  P. 

(c)  Attempt  to  reach  a  point  such  that,  whatever  P  does,  the 
interception  time  is  a  minimum. 

34.  With  three  information  categories,  three  strategies  for  P  and 
three  strategies  for  S,  we  have  a  total  of  27  combinations  to  study.  What 
is  a  suitable  criterion  of  effectiveness?  One  obvious  choice  which  will  be 
considered  here  is  the  shortest  distance  between  P  and  S  during  an 
attempted  penetration.  If  necessary  this  can  later  be  translated  into 
probabilities  of  detection  and  kill. 

35.  We  found  in  fact  that  the  major  part  of  this  study  could  be  carried 
out  analytically.  The  combination  of  random  information  with  any  of  the 
other  possibilities  defied  analysis,  however,  and  for  these  cases  we 
resorted  to  a  digital  computer  simulation.  Now  it  is  particularly  impor¬ 
tant  when  employing  digital  computer  simulation  to  invoke  a  check  on  what 
one  is  doing.  What,  then,  would  be  a  suitable  check? 

36.  I  would  like  you  to  consider  the  situation  in  which  at  time  t*=0 

S  is  at  E  and  P  at  A.  P's  strategy  is  pursue  the  diagonal  track  AC,  while 
S,  when  it  receives  information  as  to  P's  position,  attempts  to  equate  its 
x-coordinate  with  the  last  reported  x-coordinate  of  P.  Assuming  that  the 
distances  between  A  and  E  are  large  we  then  are  naturally  led  to  consider 
a  situation  which,  evolving  over  a  sufficiently  long  time  consists  of  a  chase  of  P  by  S. 

37.  In  projection  along  an  x-axis  parallel  to  AB,  P  moves  continuously 
from  left  to  right  at  a  speed  v,  say.  S,  when  it  receives  information  about 
the  x-coordinate  of  P,  tries  to  equate  its  own  x-coordinate  with  the  last 
reported  x-coordinate  of  P.  It  moves  with  constant  speed  u.  If  S  arrives 
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at  the  last  reported  x-coordinate  of  P  before  further  information  arrive 

it  stop 8  and  waifs  OfVi  Ar«M  O  <3  it  J-I  4  C  Z 


38.  It  turns  out  chat  one  can  obtain  theoretically  the  statistical 
distribution  of  the  distance  between  P  and  S  parallel  to  the  x-axis  at  an 
"information  instant",  assuming  a  steady  state  has  come  about.  This 
quantity  can  be  output  very  simply  from  the  computer  program  and,  if  it 
conforms  with  theory,  it  gives  a  measure  of  confidence  in  the  random 
mechanisms  which  the  computer  has  been  programmed  to  simulate.  I 
would  not  like  to  consider  the  theoretical  problem  of  the  distribution  of  the 
difference  between  the  x-coordinates  of  P  and  S  at  an  information  instant. 

39.  Let  6m  be  the  distance  measured  parallel  to  the  x-axis  between 
P  and  S  at  the  instant  when  information  is  transmitted  to  S  for  the 

_ th  ,  . 

m  time  since  toO.  With  obvious  notation,  since 


=<2m>  =  V£m-1)  +  vTm 


and 


V 

we  have 


* 


6  h  v  T 
m  m 


if  x  (2  , ) 

s  m- 1 ' 


+  u 


T  > 
m  — 


m-1 


); 


m 


=  $  .  +  (v  -  u)T 

m-1  m 


if  *.(sm  ,)  +  u  T  <x  (Z  ,) 
s  m-1  m  p'  m-1 ' 


or 


5  =  v  T  if  6  ,<uT  • 

m  m  m- 1  —  rn  ’ 


=  6  .  -  u  T  +  v  T  if  6  ,  >  u  T 


m  m- 1 


m  m  m-1 


m 


Writing  rm+1  =  &m  -  uTm+1>  we  have 


12 


(a)  a  straight  unvarying  track  from  A  to  C ; 

(b)  a  track  composed  of  a  straight  portion  and  one  change 
of  course  at  an  arbitrarily  selected  moment  before 
reaching  EF ; 

(c)  a  random  zig-zag. 

Strategies  are  also  postulated  for  S.  It  can,  for  example, 

(a)  Predict  the  track  of  P  on  the  basis  of  the  most  recent 
information,  and  strive  to  reach  the  point  of  intersection 
of  that  track  with  EF  in  order  to  intercept  P. 

(b)  Attempt  to  equate  its  x-coordinate  with  the  last  reported 
coordinate  of  P. 

(c)  Attempt  to  reach  a  paint  such  that,  whatever  P  does,  the 
interception  time  is  a  minimum. 

34.  With  three  information  categories,  three  strategies  for  P  and 
three  strategies  for  S,  we  have  a  total  of  27  combinations  to  study.  What 
is  a  suitable  criterion  of  effectiveness  ?  One  obvious  choice  which  will  be 
considered  here  is  the  shortest  distance  between  P  and  S  during  an 
attempted  penetration.  If  necessary  this  can  later  be  translated  into 
probabilities  of  detection  and  kill. 

35.  We  found  in  fact  that  the  major  part  of  this  study  could  be  carried 
out  analytically.  The  combination  of  random  information  with  any  of  the 
other  possibilities  defied  analysis,  however,  and  for  these  cases  we 
resorted  to  a  digital  computer  simulation.  Now  it  is  particularly  impor¬ 
tant  when  employing  digital  computer  simulation  to  invoke  a  check  on  what 
one  is  doing.  What,  then,  would  be  a  suitable  check? 

36.  I  would  like  you  to  consider  the  situation  in  which  at  time  t=0 

S  is  at  E  and  P  at  A.  P's  strategy  is  pursue  the  diagonal  track  AC,  while 
S,  when  it  receives  information  as  to  P's  position,  attempts  to  equate  its 
x-coordinate  with  the  last  reported  x-coordinate  of  P.  Assuming  that  the 
distances  between  A  and  E  are  large  we  then  are  naturally  led  to  consider 
a  situation  which,  evolving  over  a  sufficiently  long  time  consists  of  a  chase  of  P_by  S, 

37.  In  projection  along  an  x-axis  parallel  to  AB,  P  moves  continuously 
from  left  to  right  at  a  speed  v,  say,  S,  when  it  receives  information  about 
the  x-coordinate  of  P,  tries  to  equate  its  own  x-coordinate  with  the  last 
reported  x-coordinate  of  P.  It  moves  with  constant  speed  u.  If  S  arrives 
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at  the  last  reported  x-coordmate  of  P  before  further  information  arrives 
it  stops  and  waits.  Otherwise  it  continues. 

38.  It  turns  out  that  one  can  obtain  theoretically  the  statistical 
distribution  of  the  distance  between  P  and  S  parallel  to  the  x-axis  at  an 
"information  instant",  assuming  a  steady  state  has  come  about.  This 
quantity  can  be  output  very  simply  from  the  computer  program  and,  if  it 
conforms  with  theory,  it  gives  a  measure  of  confidence  in  the  random 
mechanisms  which  the  computer  has  been  programmed  to  simulate.  I 
would  not  like  to  consider  the  theoretical  problem  of  the  distribution  of  the 
difference  between  the  x-coordinates  of  P  and  S  at  an  information  instant. 


39.  L.et  6  be  the  distance  measured  parallel  to  the  x-axis  between 
P  and  S  at  the  instant  2  when  information  is  transmitted  to  S  for  the 
th  m 

m  time  since  t=0.  With  obvious  notation,  since 


and 


x  (2  )  =  x  (2  .)  +  v  T 

p  m  p  m-1  m 


x  (2  )=x{2  .)  if  x  (2  . )  +  u  T  >  x  (2  ,) 

s'  nr  p'  m-1  s'  m-1'  m  —  pl  m-1' 

x  (2  )  =  x  (2  . )  +  u  T  if  x  (2  , )  +  u  T  <  x  (2  , ); 

s'  nr  s'  m-1'  m  s'  m-1'  m  p'  m-1" 


we  have 


6  «  v  T  if  x  (2  )  +  u  T  >  x  (2 

m  ms'  m-1  m—  p'  m-1 


6 

m 


=  6 


m-1 


+  (v 


u)T  if 


m 


x  (2  .) 

s'  m-1' 


+  u  T 


m 


<  x  (2  .) 

p'  m-1 


or 


6  =  v  T  if  6  ,<uT; 

m  m  m-1  —  m 


6  =6  ,-uT  +vT  if  6  ,>uT. 

m  m-1  m  m  m-1  m 


Writing  r  ,,  =  6  -  uT  we  have 

mil  m  m+1 
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Cn 

tl 

V 

T  if  r 

<  0 

m 

m  m 

fi  = 

V 

T  +  T- 

if  t*  n 

m 

m  m 

“m 

e. 

cr> 

tl 

max  Fv  T 

V  T  +  r  ] 
m  mJ 

m 

m 

40.  The  last  equation  is  extremely  reminiscent  of  the  equation  for 
waiting  time  in  a  conventional  queueing  process.  In  fact  the  distribution 
function  of  6^  can  be  easily  derived  theoretically.  The  agreement  of 

independent  calculations  of  this  theoretical  result  (in  the  steady  state) 
with  the  empirical  distribution  derived  directly  from  the  computer  program 
inspires  confidence  in  the  latter  as  a  representation  of  the  real-life  situa¬ 
tion  which  it  was  desired  to  simulate. 

41.  I  would  now  like  to  point  out  that  the  situation  I  have  described 
here  is  formally  a  rather  unusual  single  server  queueing  set-up  in  which 
arrival  and  service  intervals  are  correlated.  The  connection  was  observed 
by  Mr.  Cruon  when  a  paper  on  this  subject  was  presented  to  the  NATO 
Conference  on  Queueing  Theory  in  1965. 

42.  Denote  the  mth  piece  of  information  by  I  .  It  arrives  at  time 

X  ,  Let  us  now  interpret  I  as  a  customer  who  demands  as  service  that 
m  m 

S  be  moved  from  wherever  it  is  to  a  position  with  x  coordinate  equal  to  that 

of  P  at  time  X  .  Since  the  distance  between  P  and  S  at  time  E  is  6 

m  mm 

then  obviously  if  waiting  time  includes  time  to  complete  service,  and  since 

S  moves  with  speed  u,  the  waiting  time  of  1  is  6  /u. 

m  m 

43.  The  arrival  intervals  T  6  the  customers  I  are  by  definition 
distributed  according  to  a  negative  exponential  distribution  with  mean  r. 

If  we  say  that  service  on  1^  cannot  begin  until  S  reaches  the  position 

specified  by  I  ,  then  the  actual  service  time  of  1  is 
’  m- 1  m 

1  vT 

—  [  x  (X  )  -  x  (E  ,)]  =  —  ,  say.  Thus,  service  time  in  this  model 
u  p'  m  p‘  m-1  ‘  u  ' 

is  also  negative  exponentially  distributed  with  mean  \t  ,  where  r  =  v/u. 

44.  Writing 

(-)n-1 

n  (l-\)  (l-\2)...(l-\n) 
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we  have  for  the  steady  state  distribution  of  6 


P  [0<  6  <x]  »  1*  I  a  exp  [—  ( 1  +  ~  + 
r  -  ..  n  vr  A 

n>l 


We  have  constructed  a  table  which  shows  the  comparison  between  theory 
and  simulation  of  the  distribution  of  6  for  1000  trials.  It  can  be  seen  from 
this  table  that  the  agreement  is  satisfactory.  Consequently  one  can  have 
confidence  that  the  random  mechanisms  employed  in  the  simulation  of  the 
major  problem  are  in  fact  behaving  as  they  should.  Equally  we  have  an 
instance  of  how  an  Operational  Research  problem  in  an  apparently  completely 
unrelated  field  led,  as  a  by-product,  to  an  unusual  queueing  situation. 
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EXPERIMENT  II 


Int  =  500 

bata/l****1*11 

» ota/lan'  dzr 
!  ota/lanbdzr 

bota/lan!  dzr 
1:ota/lan!dzr 
Vota/lan’.-dzr 
bota/lambdsr 
’  ota/3 aabdzr 
beta/lanbdzr 
’•ota/3 am’  dzs 
'.ota/’ar.'  dz= 

Lota/lanl  <lzr 

Vota/la:/  dz= 
b.ota/lanbdzr 
ota/lanbdzr 
bota/lan'-dzr 
’ eta/lanbdzs 
bota/lanl  dzr 
tota/lan!  dzr 
bota/lanbdzr 
bota/lanbdzr 
1 cta/lantdzr 
bota/lanbdzr 
bota/laral  dzr 
’  cta/lar.bdzr 

’  eta/lar.'  dzr 
!  ota/lanbdz- 
ota/lan'  <lz- 
bota/lanbdzr 
'  ota/lan!  dzr 
1  ota/lan’ -dz= 

1  ota/3  ant  dzr 
'  ota/lan'  dz- 
'  cta/lamidzr 
'  ota/lan!  dzr 

’  cta/lantdzr 
'  cta/lam!  dzr 
!  ota/lan!  dzr 

!  ota/lanbdzr 
'  ota/lan!  dzr 
’  ota/lan!  dzr 
' cta/lanbdzr 
!  ota/lan!  dzr 
'  ota/lan!  dzr 

cta/lanl  dzr 
!  ota/lan’  dz- 
ota/lan!  dzr 
!  ota/lanbdzr 
;  o*'a/lan!.dz= 
’•ota/lan!  dzr 
!  ota/lanbdzr 
hota/lan’-dzr 
!:ota/lan!  dzr 
bota/lanbdzr 
'■ota/lanbdzr 
lota/laffl!  dzr 
’  ota/lan!  dzr 
’  ota/lan!  dz- 
!  ota/lan!  <lzr 


.34230956 

1  .0322601  ' 

1  ."3463C6 
1  .0340420 
."1  60044.0 
,G41 24010 
.361  02502 
1  .1020123 
1  .or  41 004 
i  .oomi 
1 ,3020331 
1  .1  2C5301 
1 .3000330 
1 .3C33 346 
','oGrr'3«— 

1  .45001  CO 
1 .4700253 
1 .0512275 
1  ,  "020333 
,41  O'"  7421 
1  .053320-0 
1  .01 701  32 
1  .1712307 
.50640076 
1  .4222012 
.45422025 
.332054GG 
.0  1570253 
.5" 6634 77 
.73050352 
1  .4632070 
1  ,e”r0,507 
;.1  '72007 

50rO6°00 

*.4  *.V.”  r1" 

1  .1106354 
.!!4C540GC 
.7641  5753 

.431 ''6730 

.41 440  500 
1  .31  011  2C 
.01130062 
.37417700 
1  .300  301  3 
1  .”01  5”  31 

'or ri  n 

'  row. 

1  .31 64032 
. 5G2C004G 
.  5!'4C"704 
1  .24”!!20: 

1  .1002205 
.50000563 
.70233160 
.62453075 
1 .0050725 
.54300473 
1  .,-irr24G 


naans  22.025329 

noanr  5.r2"2G22 
nor.nr  7.0422105 
moa ~  7.2511035 
means  34 . 701 220 
means  11  .720510 
means  2.7120050 
noanr  u .  2  0  "  1  '43 
means  5.7214632 


noanr 

noanr 

near.r 

neanr 

means 

noanr 


noanr 

noanr 

moans 
moan  r 

near.r 

noanr 

noanr 


means 

noanr 

r.oanr 


5.27040  70 
5.3625737 
G.C2CC427 
5.07521-41 
5.71  “1245 
7.  "■'’04222 
2.1  537321 
5.0225714 
4.-1  23225 
4.4034045 
10.275733 
7.1134427 
7.373320C 
0.3044307 
1  :  .  31  3200 
5.0552401 
:  16.577750 
:  22. 70  721  3 
11.  047044 
■  12. "07000 
:  0 .3515727 
:  2.1  "C7452 

-  0.0004544 

-  3. ""01  044 
:  1 ’."40350 

-  10.60275' 

-  1  .103040 

-  r  ""201  4" 
r  30 .3570-41 


c  rr'i'i'inr 

1  7. '77002 
1  0.1ti>p45 
5.761  7tC0 

o  .0522205 

.1  4422T 

7.3460770 
IT .005533 
2.3103425 
5 .600.0263 
1 n .375260 
10.343171 
0 ."35CC05 
6.2:772342 
10.777621 
1". 605435 
12.0441  50 
7.4550BC3 
13.0S7733 
=  4.6660,053 


TABLE  1 


EXPEKIMEHT  II 

beta/laabdzr 

bota/laabdzr 

bota/lanbdzr 

bota/lambdzr 

bota/lanbdz= 

bota/lani  dz= 

1:eta/lambdz= 

bota/lanbdzr 

bota/lanbdz--- 

beta/lanbdzr 

bota/lambdz= 

bota/lambdzr 

bota/lanbdzr 

bota/lambdzr 

bota/lacLdzr 

bota/lambdzr 

beta/1 ambdzr 

bota/lambdzr 

bota/lambdzr 

bota/laral.dzr 

bota/lambdzr 

bota/lanbdzr 

bota/lambdzr 

bota/lanbdzr 

bota/lanbdzr 

fcota/lanbdz= 

bota/lanbdzr 

bota/lanbdzr 

bota/lanbdzr 

hota/lanl.dzr 

bota/lanbdzr 

bota/lambdzr 

bota/lanbdzr 

bota/lanbdzr 

bota/lanbdzr 

bota/lanbdzr 

'■ota/lanbdzr 


(b) 


,38392241 

,  BO  (190907 
."3074320 
1  .3062971 
.42771174 
.24335117 
.32336100 
.00427377 
.01  .  24RCC 
.73361413 
.61332436 
1  .1 31  3C4G 
.42163116 
1  .3234232 
.73173092 
1 .1763334 
1  .1503356 
1 .1409923 
1  .25'.  7CG5 
1 .4320436 
1 .5370753 
. SG4o21  jw 
.65337223 
.7331 3244 
1 .4762043 
.72325336 
.73225341 

n  JO'tOO'' 

, 675321 22 
1  .2423334 
1 .3535033 
.9321 7336 
.44223312 
1 .2722371 
1  .4943932 
.73404610 
.37233729 


p  meanr 

r  meanr 
->  meanr 

-  mean= 
meanr 

•'  meanr 
p  mtn= 
meanr 
i  noan= 

;  meanr 

-  moanr 
p  meanr 
p  meanr 

meanr 

■  meanr 
p  meanr 
r.  meanr 

meanr 
•  near.r 
;i  neanr 
p  noanr 
p  neanr 
p  oeanr 
p  neanr 
p  moanr 
p  neanr 
p  moanr 

■  noanr 
••  meanr 
p  neanr 
-*  neanr: 
'  neanr 
p  noan= 
•-  noanr 

moanr 
■  noan= 
■■  moar.r 


19.831528 

9.3080638 

2.0625514 

3.7273552 

17.615500 

31  ."06223 

f  ,"60r729 

f  .3933030 

IT.  133727 

2.5264612 

12.156744 

6 .61 21  422 

17.370403 

5.3461 723 

2.2961053 

6 .2064650 

6.5091562 

C.555C277 

5.9409340 

5.91  70 797 

4.2534430 

9.2352720 

11.501221 

19.223607 

5.  ■'61  2533 

1'. 349537 

ir .203222 

3.3409.467 

11  .133325 

6.0225453 

5.5254236 

9.0500009 

1  2.779254 

3. 5050509 

4.9922200 

9.59.23141 

12.521136 


bota/lanbdzr 

.95579134 

noanr  7.9;43C917 

bota/lambdzr 

.31001430 

neanr  C .2721 324 

bota/lambdzr 

1  ,ro6nC24 

;•  noanr  7. "222434 

1 ota/lami  dzr 

1  . 31  34„4l 

-  neanr  5.0723623 

bota/lanbdzr 

.43213244 

neanr  17.43205G 

beta/lambdzr 

.34069742 

-  neanr  2.93255*1 

1-ota/Iambdzr 

1  .1423622 

v  neanr  C. 554945" 

I.  ota/lam’ .dz= 

.51252023 

-  noanr  1  1.432721 

bota/lanbdzr 

1  .720051  7 

noanr  4,1270791 

bota/lanbdzr 

.CO 573 643 

r  neanr  10.9.n621C 

bota/lanbdzr 

.62102464 

-  noanr  11.  ■''26297 

bota/lanbdzr 

. 402GG0 G2 

■  noanr  1 -.705343 

bota/lam!  dzr 

.93226711 

p  noan=  7.C23G724 

lota/lanl.dzr 

1  .1245514 

p  moanr  G. 7341 767 

bota/lanbdzr 

.93224521 

•  meanr  7.756C439 

bota/lambdsr 

1 .4554341 

l  neanr  5.1343322 

bota/lanbdzr 

9.  "31 7961 

■  noanr  3.6967150 

bota/lanbdzr 

1 .5701221 

-  meanr  4.7547402 

bota/lanbdzr 

.31 GC7C25 

p  noanr  23.720427 

bota/lambdzr 

.71664559 

p  noanr  19.432174 

beta/lanbdzr 

1 .91 C"41 1 

P  noanr  3.3815304 

bota/lanbdzr 

.73095111 

meanr  9.G20527G 

bota/lamldzr 

.92550630 

neanr  9.1 '■25221 

bota/lanbdzr 

.54056263 

p  neanr  16,925436 

bota/lambdzr 

1  .1219212 

•  moanr  6.G79"977 

TABLE  1  (continued) 
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Best  Available  Copy 


EXPERIMENT  II 

bet*/l»«bdr- 

’■ota/lan!  dzr 
l:ota/Ian»dzr 
:  otn/lam'dz= 
t-cta/lant  dzr 
! cta/lanbdzr 
’.  ota/lan’.  dzr 
!  ota/lan.ldzr 
hota/lanbdzr 
1  cta/lan!  dzr 
lota/Ja!ni'dz= 

1 ota/lambdzr 
bota/lon!  dzr 
'  ota/lam!. dzr 
'.•ota/lanldzr 
1 ota/lanbdzr 
hota/lanbdzr 
Lcta/lanbdz- 
)■  cta/lan)  dzr 
!  ota/' an)  dzr 
!  ota/ianbdzr 
1  ctn/lar.)  dzr 

’.ota/Iar'>dz- 
)  Citr./lar.)  dzr 


2.5662482 

i  .i  377333 
.rr-r7ci:' 
1  .1  734614 
i  .i  -rr?i7 
r.*:?35i  75 
.54333(335 

,or  n  doer 

."•1  G61CC2 
.4303GGE3 
1  ."C51H". 
.74747333 
..*■31  77414 
.5:473453. 
1.".*1  0477 
.GOr  52C3? 

•  •  I 

.  ""401  or.  5 
i .  r>:  tori  r. 
i .  nr  oor 

.“CoVI  "0 

i  .rrvif-ir* 

.'i55G5rr 


p  mean 

*  mean 
moan 

-  r.o an 
•'  roar. 

*  noan 
v  moan 
p  non  r. 
p  noan 

non.  r. 

*  moan 
;  mean 
p  mean 

-  rear. 

-  near 

*  noan 
•i  roar. 

r:oa~* 

:  :.iean 

*  near 
•••  near- 


2.8859421 

r.sr-rcci 

11  .153395 
C.4C-453C2 
r.24  53n 
2. 324 ''•CO  7 
1  5. f 52527 
2r.:-7r'2S7 
r.l  377551 
17.5CC51C 
7.4GCC715 
K .954231 
11 .3C23G3 
13.3CCSGC 
.-.'321733 
1 2.343320 
4"30471 

1—  r-A  -»  f 

4. '57*753 
7.1  IT  "75 
'  .773'- 1  53 
i  .*:''J73 
7.3341  TT.r 
~.1"7*OC7 


)  ota/lan)  dzr 

1  .'T>4Gr<<-' 

••  near- 

7.1*34437 

1  Lc./lar.'  <lr= 

1  -rr.7 

rear.  - 

-  •*  i 

1  ota/1  an’  dzr 

r  -J  •>  *7^  r.  ** 

roan  - 

-  .r  n  ;i 

’  ot .-./'  an’  dzr 

*• •  r  tm-mr-  r\ 

r.c  r.:r 

.  -.y- 

1  ota/l  an’  dz~ 

1  .*'•15735 

near.- 

! .  "C47337 

)  ota/lanldzr 

1  .51  DOC  11 

;•  means 

4.3 377743 

)  cta/lan’:  dzr 

1  .1  *5414 

•>  near- 

*.3433141 

1  ota/lar.)  dzr 

.T  4  7  3774 

;  r.ea-  : 

1  -  "or  »  ^ 

•  •  •  *■*  - 

:  ota/lan>dzr 

.7T?34r7. 

•  near,  t 

,7T  3.':; 

■  eta/lard  ;lzr 

1  .?::'75334 

near.  - 

7.*  :i  :*77 

beta /Ian"  dvr 

*-#•*•  r\  *•  A  r  * 

•  ’•  -*  • 

men 

:  '.3' 57-1 

ota/lar.’  dz- 

1  .*1  "7.-37 

nor.”  - 

7.  T  '31. 1 

ota/3  an'  dzr 

.  4  ‘  31  l.T  1 

•  near.  - 

17.431747 

ota/lanbdzr 

i  .i  -r-:r 

near- 

2. 252  2242 

cta/lan’  r,~~ 

1  .' "••'•775 

-»  roar-- 

*.1  7.:3735 

ota. Tan'  i’s- 

1  .1  '*7733 

no  a  s'.-: 

•.'11**37 

c..a  '1  an',  dzr 

.*:  1 3". 71  • 

;  near.* 

'  .1  44371  3 

o  ca  'Tan’-dvr 

.  .17  I  T 

r.c  an- 

tv:  :i  4577 

to. a  'Ian)  ,!z- 

..-44  ’771  3 

re  a:. 

11  .7331' 3 

•eta  Tan'  dz- 

1  .1  T“7D  2 

rear. : 

:.M  5333 

:.ci.a/!an'  dr.- 

:  ..  T’T1.‘ 

rear.  - 

•  w.  w T2 7 

eta. Tan)  dzr 

i  .  3  745 

means 

3 . 1  5733  74 

'  ota/3  an),  dzr 

1  ..1774'*': 

noan:- 

■' .  0-  -Vvi- 

'.ota/lard  dzr 

.75 "37"  4 

nor.  a- 

11  .77.  *731 

'cfa/lan’dzr 

...  1 ' 31  T 

•  rear. 

.3 . 37.477-7 

1  ota/lan’. dzr 

.543""  .’.7 

r.tr.v.- 

1  .'7431 : 

)  ota/lar'  dzr 

.4 

noar.r 

13.  ..**33* 

’  ota/lan!  dzr 

.771  337G.1 

p  mean* 

1*..  *31744 

'■  ova /lari’  dzr 

.*7"  34  "47- 

'  near.- 

*.3*7135* 

'  -.ta/lanl  dzr 

1.  *7'  5*5 

■  roars 

3.:  '73.*ar 

'.ota/lard  dvr 

1  ."54"T- 

near.:: 

1,1  J  It  O. 

i  ota/lan)  dzr 

.7*4*4' 71 

noan- 

1.3.433:3:: 

)  ota/lan'  dzr 

1  .'I  37"77 

moan  r 

7,3-43333 

’  ota/lar.v  dzr 

.433711  !* 

noan= 

11.P*  33.,' C 

’  ota/lan’.  dzr 

7 

noan  s 

.  5....W.. . 

TABLE  I  (continued) 


Best  Availab  e  Copy 


EXPERIMENT  II 
b«ta/la*bdz= 

'  eta/lanbdz- 
’■cta/lan:.'dz= 
l’ota/1au:;dr.= 

lota/lan’  dz= 

•  cta/lar/  dr.r 
!  oi.a/Iati'  •«!?.= 
bota/lan'  dz= 

1  ota/lar-.’.. dr.= 
V.ota/lambdzr 
Veta/lan!  dzt 
l  ota/1  an!  dz  - 
bota/lars’..dz= 

!  ota/1  au'odz  = 
liota/lan~da= 
hota/laiabdar 
!  ota/lanl.dzr 
’.ota/i  ar.:;dz= 
’•ota/i  an!  dz~ 
i  cta/1an'..dz= 
lota/1  an!  dz= 
bota/'lr.n’.tl::- 
i  cta/l.in’.:d7.= 
’•cta/1  vV>  dz- 
!  cta/'l  an;  dr.= 
1-cta/1nni.dz  = 
:.cVa/1ar.;.d7.= 
bota/lar.’.  t'.7- 
'  otr./lan;  dr  = 
’•eta /’. ar.!  dzr 
I  cta/lan'  r!z  = 

’  cta/lan':;Iz= 

’  c  '  a/1  nr  drr 
'  oto/1  an'»dr- 
'  ota/Ian!  dzrr 
ota/Ian’  dr.= 
’■ota/iar.,'!z: 

'  ota/i  an’ 

'  ota/'an’  dz  = 

1  ota/i  an!  dr.T 
!  ota/lan!.dz= 
i cta/lar'  dz= 

’  ota/Ian!  dz  = 
cta/lari’  dz= 
-.cta/lar.’ 

ota/Ian’  >\z~ 

'  eta /lanbdzv 
!  ota/1  anl.dz- 
l.ota/lan’>d7.= 

"  •  c  t  a  / 1  a  n! .  d  z — 
!.cta/1a:i'  d7.= 

!  ota/Ian!. dz= 

!  otr.  Tati’.dzr 
1  ota/lar.’  ■ 

! ota/lar’  dz  = 
’  ota/lar.;!  dz= 
I  ota/]an’.dz= 
’  ota/1  am’  dz  = 
’..ota/lan’dzr 
!  ota/lara!.dz= 
’  eta/1  anl.dz  = 


.14126939 
.1747^71  f. 

i  .-n  !'-7-:i 
.4372435- 
.7n.':f4'-30 
123374 
1  .137724.7 
.01 337400 
.  57.5-.7000 
.  .’33031 1r- 
.r,"70J.’-r;c 
1 .”21.7330 
.G3f;3G-lf'3 
1 .1400376 

1  .:.E21  •■’20 
1 .3 07000 2 
.52123130 
.733; 4732 
.01 1 00337 
1  ..-111C7C 
.7  7.TC251 
1  .  1 1  31  C4 
.3-13330' 3 
,  53G1-  7.1.> 

1  .1002771 
,  <  *  * 

.•  •■3'.:r 
1  .-1171  21. 
.n-  33243-1 
..*•*1  35327 
.!  3371  330 
n  i7o 
i  .i  io'-o' 
.2271  "1  37. 

1  .'.1324;  1 
.  52071  ■'  7.. 

1  .2277'.  SC 
1  .3270: 12 
i  ."2o::  r 
.'-2225535 
1  . 4501 1  27 
1  .71 00202 
.03'T.O773 
.0737732. 
i  .i2r- 
.'2434317 
2.'  134733 
1  .  :0273'!3 
.  2,r  1  04-1 02 
.4"1  7243; 
.r.<-321  1  ’4 
1 .3043222 
1 .-245710 
.21  !  3C500 
1  .1  2.70502 
1 .”700021 
1 .3574427 
.55205413 
.321  03271 
.50043132 


p  aaan= 


p  ucar.= 


53.454910 

42-.1 27C35 
5.21042'  ' 
I-.!  30177 
7 .2573243 
7 . 50 j~77r 
0 .  '1777252 
1  _.105G30 
14.  .’.121-0 
'•.'•4C  COO” 

1  .444!  54 
7.3371 255 
1 1 .774204 
C.331  412 
3. 0.. 17242 
2.1-701  22 
0.7.2  72  703 
1  1.4441 53 
1-. 142502 
i:.:~03CC. 
4.11 47442 
1 ;  .  .4-32  - 
-0.1  .11325 
:  21  .070730 
=  I!’.  '14230 
r  0.7727427 

r  :  122.-2 

r  •  .12. 7042 

=  7C." 34231 
=  17.7:411'' 
r  7 . 2321 '32 

-  1-  ..-  7271 

=  7 ."77472' 
=  7.30457.73 

-  14.'  274:: 
=  5.  1  5112 
7z  4. '  7 13747 
=  7.1 31  521  7 
=  11.3!  5351 

-  5.1  312 772 
r  ".ir.-Mrcr 
=  11 .47  702 
_  1.  .212212 

-  7.  IT  541 

=  7.1143350 

-  7 .3554355 
=  7  .'’3371  37 

-  1 5.01 7407 

-  1  .'.771702 
i=  1.7.705734 
=  5.7352057 

7.7127117 
i=  34.371717 
i—  0.002'’ 474 
;=  7.-04271  7 
i=  5.5251  701 
;r  13.034343 
ir  23.441477 
i-  14.772322 


TABLE  1  (continued) 


Best  AvailabV 


Best  Available  Copy 


EXPERIMENT  II 


<o 


beU/luMt:  .30063233  P  •**•»=  25.087059 

beta/lambdzs  .27749743  ~  means  5.5554312 

'  ota/lan1id*=  .65540310  r  *»»"=  16-.'C3CS7 

!  ota/la:V:  tlzr  .nsSCIS'l  moans  10.53*763 

'*  tota/lan' <Iz=  .5725521'’  ■  neann  2,671771  0 

bota/lam' <lz=  .25440432  ■'  moans  01  .70029G 

'■  ota/lnr.-’  dz=  .7112010  ~  moans  1 162772 

V cta/lara’  dz=  .:  2700245  r  moans  7.711  :.'05 

?•  1  ota/Iambilzs  .62702234  noanr  11. ’70 732 

1  eta/lan’.-dzs  moans  1'  .'26025 

>  ol a /law!  tl7-  1.2407123  '  aoans  7.: ''42247 

;  ota/TarAilns  .'720.2  0.1'  ••  moan-  7.72::1773 

%. .  «  ota/larf  t’.zs  1  .""""I ?>  -  noai\=  7.4223020 

1-ota/larv  <1s=  1  .7-2"4" 1  noan-r  4 .1  *"1 332 

'<  c'  Tv/lanl.dzr  .0112-4015  -  moans  1  .3-01 20 

■  !  otu/?a.v  d==  1  .-'7—522  noans  C.  377750 

5 .eta/' at»  ,!«=  1  .OO-O.-OO.  ;  noanr  5.3434343 

Uala.-’anl-dzs  2.7545313  •  means  2..C045553 

f  :  »ta/lau:  U==  1.51260'’*  moans  4.0272522 

•  a 'lar:  <17=  .""6 71 moans  1  .  "25435 


pta/Jau’  :!:■ 

.  7'M  33.. 31 

moan- 

r.71  3107 

o*;a  '.tai.;’  -Is-; 

.0,0204'  70 

mean- 

10. *4 3- 75 

<;C -./lar.’.1  -!rs 

1  .'-3211  "5 

meant 

.*.  "IC'7: 

•ota  -'Ian'  <lz= 

,  542 "01 1 

noanr 

1  '.7  12220 

ot.i.'*  a:  /  d?.s 

1  .11  "2044 

P 

meant: 

0. 7"1 7730 

vta/lanbdzs 

.240  640  "1 

noanr 

ctr.  'i a: 0  <lz= 

.  5241  2741 

moan- 

1  0. -3224.3 

ota  la:;'  <!z= 

.670.20  "4* 

•1 

noanr 

11 .‘73713 

««ts  ‘l ar.'  <!z= 

."2  330>2"  r, 

noant 

n .  :  r 

<•*. a'1  a:..'  tins 

1  . 1  71 1  36.1 

rear.:.* 

.2  2172" 

t-‘  a  '’an:  dzs 

."632"  "3 

P 

noanr 

r  .2'  70471 

<  *  a  /l  a;  •'  Oj." 
or.-.  /latlVdzs 

1 . 332421 3 

- 

meant 

3. 01  2231  0 

.5:42,:.C3C 

near.  - 

1  ‘ ,  363-140 

^  ^  /*  n  .  ,i.  .|—  _ ■ 

.* 

r*.oar. 

0  •  .*  Z7TC7 

o'.-,  'lav'  4zs* 

1  . "27707" 

near.- 

J  • 

nta/lai-J.  dz= 

.63023734 

noanr 

1 1.  ."."30,43 

oi-  'la:  '  .Izs 

, '  07“1  "1  '• 

:.ioa:'.t 

-.  .’7070  0 

otr.  'lav.'  :'z - 

1  ."1  10773 

moan- 

•2.2  001  40 

ota  '’a-r  dzs 

.77111  172 

noa:.-- 

' .7446132 

-.  a  'la::'  'a- 

1  ."'017-0 

r.oar.- 

.0.  O' 7*71 3 

'.'a /lav'  !z— 

.7:  -3 '  "1 

nor.".  • 

■  ,-t  :r.:'rc" 

i ;  a  / 1  a  I.;:  :1a  s 

.  j**rc'  r.r.- 

roar.-: 

1-:.:  41 270 

a /"  a:..'  :1a- 

c:: y f*  "n*! " 

mean  - 

11  .  '40 "77 

nta/lam!  Jz: 

-70711." 

r.oa  iv 

1  "  .  '.'-7  700 

o  a  ''lar.il  dzs 

1  .  r  ".  2; 

men  - 

1.  '37' 10 

■  ■  a  4a.  s 

.  -  1 .. 

0  .1  -2330 '■ 

r.  .'lari'  <Ia- 

.-1"31 

;.:oa  ' ~ 

1 4.745301 

a/lar.'  dz= 

1  ."1"  "'14 

■1 

non  - 

O  ,  i  .  »  1  -U- 

'1  -in'  (|7.r 

.'■3!  0770" 

nsoaat 

•  0.2-1030 

'  ••oa/laul  dr.r 

.036.33633 

near.  01  .1 34755 

'  i-t  a  'lav  dzs 

1  .:'7-'i"ri 

cicaa=  3.437r'172 

c  ’  -  '1  an'  ilr.r 

.*  43624  "7 

cc.r.r  7.r2C4560 

'  cl  a/lar.:  <lzs 

.  20  7  2  - 1  -  *  2 

roar.-.-  1  ',V'f'0C7 

'  c.  a  .'lav'  <Iz= 

'  3. 3  50 

r.ca  v.  -  0 . 511  1 1  34 

i  r.r-\/i  an*  dzs 

1  .'71 3":"3 

nca.i.-  2.77407G2 

i  cta/Ia:2:dz= 

.’77777.3 

:  roarv:  ‘  075025 

1  ."7"  121 

;r  1:10a •'!-  7,3-150  570 

'  c!  a/lan!  dzs 

1  .77020.1  5 

:  r.oar: -  4.0" 40310 

!.ota/lanl.dz  = 

1  .3:  34440 
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EXPERIMENT  V 


COMPUTATIONAL  CONSIDERATIONS  IN 
MULTIPLE  LINEAR  REGRESSION 


Harold  J.  Breaux 

U.  S.  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland 

INTRODUCTION 

The  statistical  theory  concerned  with  multiple  linear  regression 
and  simple,  partial  and  multiple  correlation  is  highly  developed  and  has 
been  one  of  the  most  useful  tools  of  analysis  provided  by  statistics.  The 
widespread  availability  of  modern  high  speed  computing  machinery 
makes  practical  the  solution  of  many  regression  problems  which  before¬ 
hand  might  not  have  been  attempted  due  to  the  inherent  computational 
difficulties.  High  speed  computing  machinery  enhances  the  value  of 
multiple  linear  regression  by  removing  the  computational  drudgery 
and  making  possible  more  sophisticated  procedures  of  analysis.  Despite 
the  tremendous  speed  and  computing  capabilities  of  modern  computers, 
much  can  be  gained  by  the  skillful  design  of  computer  programs  designed 
to  solve  the  normal  equations  and  provide  the  associated  statistical 
data  for  estimating  significance  of  variables  and  prediction  intervals. 

The  computational  labor  associated  with  multiple  linear  regression 
arises  in  the  formation  and  solution  of  the  normal  equations.  Efficient 
algorithms  for  solving  the  normal  equations  are  described  in  the 
commonly  used  texts  of  statistics  and  numerical  analysis,  however, 
only  recently  has  any  widespread  effort  been  made  to  fully  take  advant¬ 
age  of  the  capabilities  of  computers  for  doing  "exploratory"  type 
regression  computations.  In  problems  where  many  variables  are 
involved  the  analyst  may  have  only  intuitive  suspicion  regarding  those 
variables  which  are  significant.  When  this  is  true  it  is  desirable  to 
define  a  "candidate"  linear  model  which  includes  all  the  variables 
which  are  conceivably  significant.  The  exploratory  experiment  then 
would  consist  of  entering  this  candidate  model  and. the  appropriate 
available  data  to  a  computer  program  specifically  designed  to  analyse 
this  model,  and  output  a  reduced  model  containing  only  significant 
variables. 

One  way  to  design  such  a  program  is  to  have  it  obtain  the  solution 
to  all  the  "suD-set"  model?  that  can  be  formed  from  the  collection 
of  variables  in  the  candidate  and  choose  the  one  which  best  meets  the 
significance  criteria. 
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If  this  model  contains  N  variables  there  are  2 "  -1  sub-set  models. 

This  method  is  made  practical  for  as  many  as  20  variables  by  a 
"binary  algorithm"  decribed  by  Lotto  [  1  j »  196 1,  and  Garside  [2], 

1965.  This  binary  algorithm  defines  the  optimum  path  of  elimination 
so  that  the  Gauss-Jordan  algorithm  goes  through  the  fewest  recursions 

N 

when  generating  the  2  -1  solutions.  The  method  has  the  advantage  of 

being  always  able  to  identify  the  "optimum  model".  For  the  purpose 
of  this  paper  the  optimum  model  is  defined  as  that  model  containing 
only  variables  which  are  statistically  significant  at  a  chosen  level  of 
significance  and  which  has  the  minimum  variance  of  residuals  among 
the  sub-models  that  have  all  terms  significant  at  that  level. 

The  scope  of  some  regression  problems  is  such,  however,  that 
more  than  twenty  variables  are  required  in  the  candidate  model.  Such 
a  problem  is  one  described  by  the  author  in  BRL  Report  No.  1348*, 

"The  Computation  of  Firing  Tables  for  Guided.  Missiles",  [3].  In  this 
problem  it  is  desirable  to  define  a  candidate  model  containing  100  or 
more  terms.  A  very  practical  solution  was  obtained  using  "Stepwise 
Multiple  Linear  Regression".  The  program  was  patterned  after  the 
computational  scheme  described  by  M.  A.  Efroymson  [4]  and  is  docu¬ 
mented  in  BRL  Report  No.  1330  [  5],  For  documentation  of  similar 
type  programs  see  References  [6],  [  7]  and  [8]. 

Stepwise  Multiple  Regression  takes  advantage  of  the  fact  that  the 
Gauss-Jordan  algorithm,  when  used  to  solve  the  normal  equations  with 
N  variables,  yields  intermediate  solutions  to  N  regression  problems 
containing  respectively  1,2,  ...  and  N  variables.  The  procedure 
advances  in  stages.  In  the  "forward"  version  the  variable  which  enters 
into  the  regression  is  the  one  which  at  that  stage  results  in  the  greatest 
reduction  in  the  sum  of  squares  of  residuals.  The  power  of  the  procedure 
is  further  enhanced  by  removing  variables  at  later  stages  that  may  have 
become  insignificant.  The  decision  to  add  or  remove  variables  is  made 
by  use  of  "t"  or  "f"  tests  of  significance.  The  procedure  advances  until 
an  equilbrium  point  is  reached  where  no  significant  reduction  in  the  sum 
of  squares  of  residuals  is  to  be  gained  by  adding  variables  into  the 
regression  and  where  a  significant  increase  arises  if  a  variable  is 
removed.  The  "backward"  version  of  the  procedure  begins  with  all 
variables  in  regression  and  proceeds  in  the  opposite  direction  to  achieve 
the  equilibrium  stage.  The  relative  advantage  and  disadvantages  of  the 
two  procedures  is  dependent  upon  the  application  however,  it  seems 
desirable  for  a  well  designed  computer  program  to  contain  a  capability 
for  noth. 


* 

Copies  of  this  report  are  available  to  qualified  requestors. 
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MATHEMATICAL  BASIS  OF  THE  STEPWTSTT 
REGRESSION 


The  mathematical  basis  of  the  stepwise  regression  is  that  the 
transformation  rules  of  the  Gauss-, Tordan  algorithm  correspond  to 
recurrence  relations  that  exist  between  covariances  of  residuals, 
regression  coefficients,  and  inverse  elements  of  partitions  of  the  covari¬ 
ance  matrix.  These  relations  are  conveniently  expressed  by  taking 
advantage  of  Yule's  notation  fl].  In  this  notation  the  regression  equation 
is  written  in  the  form 
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nl,  23.  •  •  n  — . 


X 


n2.  1 3.  . .  n  -  1 


X. 


+ 


b 


n, n  -1.  12a  a  . n 


-2 


X 

n  -i 


(1) 


The  first  subscript  of  b  is  that  corresponding  to  the  dependent  variable 
X  ,  the  second  subscript  corresponds  to  the  independent  variable 
attached  to  the  regression  coefficient.  These  two  subscriuts  are  called 
the  primary  subscripts.  The  remaining  subscripts  on  the  right  of  the 
period  are  those  of  the  remaining  independent  variables  and  are  called 
secondary  subscripts.  For  a  particular  observation  equation  (1)  takes 
the  form 


X  . 
jn 


X 


jl 


+  b. 


J2 


+  b 


n  -1 


X  . 
J. 


n  -1 


e. 

J 


(2) 


e  .  is  a  residual  and  is  the  difference  between  the  predicted  value  and  the 

J  * 

observed  value  of  X  .In  Yule's  notation  the  residuals  are  denoted  as 
n 

Xn  _  j2.  , .  n  “1  ’  Since  regressions  containing  fewer  than  the  (n-1) 

independent  variables  are  of  interest  it  is  convenient  to  introduce  the 
notation 

q  =  1,  2,  a.  a  (i“l),  (k+  1),  .  a  ,  p  (3) 

Note  that  q  is  a  set  of  subscripts  containing  the  digits  1  through  p,  excluding 
i,  j,  and  k.  Furthermore  q  is  a  sub-set  of  the  (n-1)  subscripts  of  thte 
independent  variables. 


It-  should  be  noted  that  the  variables  X.  are  assumed  to  he  measured 
without  error. 
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The  covariance  of  the  variables  X.  and  X.  is  defined  as 

i  J 


where  f  is  the  degrees  of  freedom  and  the  summation  extends  over  the 
m  data  points.  Any  variable  can  be  considered  as  the  dependent  variable 
e.  g.  ,  the  residuals  X,  and  X.  will  be  of  interest.  The  covariance 

8  i.q  j.q 

of  residuals  is  defined  as 

s..  =  Yx.  X.  /i 

ij.q  L  1.  q  j.q' 


Using  the  above  notation,  the  normal  equations  can  be  written  in  the 
form 

. . .  n-1  (4) 

n-1  nk' 

k  =  1,  2 . n-1  (5) 


L  ^n.  12  ...  n-1 

xk  -  0,  k 

*  1,  2, 

or  equivalently 

8lk  V+-  8 2k 

b2  +  + 

Sn-i* 

The  complete  covariance  matrix  is 

'll 

.  8 1 2 

■  •  •  s  . 

In 

“21 

S  * 

S22 

•  »  •  8, 
2n 

"nl 

Sn2 

•  t  •  8 

nn 

(6) 


This  matrix  corresponds  to  the  augmented  matrix  of  coefficients 

usually  considered  in  solving  a  system  of  linear  equations  with  the 

addition  of  the  nth  row.  The  nth  row  is  added  so  that  the  variance  of 

residuals,  s  will  be  made  available  through  matrix  manipulations, 
nn.  q 

thus  avoiding  the  need  for  computing  residuals  at  each  stage. 


The  matrix  element  X,.  ...  is  defined  as  the  ij'  th  element  of 

ij.q  ij  k 
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the  inverse  of  the  partition  of  the  covariance  matrix  formed  by  taking 
all  the  rows  and  columns  nf  indices  q,  i,  j,  k. 


The  recurrence  relations  between  the  b's,  c's  and  s's  that  are 
of  interest  in  stepwise  multiple  regression  are  tabulated  in  Table  1. 

The  solution  of  the  normal  equations  by  the  Gauss-Jordan 
algorithm  is  equivalent  to  the  successive  application  of  linear  trans¬ 
formations  to  transformed  matrices,  the  initial  matrix  being  the 
covariance  matrix.  The  successive  matrices  that  are  generated  by 
the  recursive  equations  can  be  denoted  as  Aq,  A^,  ...  An_j. 

(k  =  1,  2,  ...  n-l)  is  the  matrix  formed  by  applying  the  transform¬ 
ation. 


k 

ij 

k-1 

a  . .  -  a 

k~l  k-1/  k-1 

ik  a  kj  '  a  kk  * 

M* 

ii  n 

1,2,  ....  (k-1)  (k+l).  . ,  ,  n 
1,2,  ...,(k-l)  (k+l),.,,n 

H-  X4 

II 

k-1  , 
'  aik  ' 

k-1 

akk 

i  * 

1,2,  . , . ,  (k—  1 )  (k+l)...,n 

k 

k-1  , 

akj  f  a 

k-1 

(7) 

-J  = 

kk 

j  = 

1,2,  » «  > ,  (k— 1)  (k+l). . , ,  n 

k 

kk  = 

i  k-1 

akk 

i  = 

j  =  k 

to  the  matrix  A^_j.  This  transformation  is  denoted  as  T^.  The 

superscripts  denote  the  fact  that  the  matrix  being  operated 

on  to  yield  A,  .  The  sequence  corresponds  to  the  introduction  of  the 

variables  into  the  regression  in  the  order  l,2,...n-l.  In  general  the 
sequence  would  be  different,  however,  no  loss  of  generality  arises, 
since  one  can  renumber  the  variables  in  any  arbitrary  fashion.  By 
use  of  the  recurrence  formulas  one  can  prove  the  following  theorem: 
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table  1 

RECURRENCE  FORMULAS 


1. 

Cij.  qi  jk 

= 

**  *  •  .  .  *"  b  .  f  d  /  q 

ki.qj  kj,  qi'  kk.  qi  i 

2. 

( 2 

ik.  qi  jk 

= 

~b  ,/s 

ki.  qj  kk.  qij 

3. 

b..  u 

Ji.  qk 

- 

b..  -b  S  /a 

Ji.q  ki.q  kj .  qi'  kk.  qi 

4. 

Ckj.  qi  jk 

= 

d,  .  ,/B 

kj.  qi  kk.  qi  j 

5. 

c 

kk.  qi  jk 

= 

^Skk,  qi  j 

6. 

bjk.  q 

a 

s,  .  /s, 

kj.  q  kk.  q 

7. 

dij.  qk 

3 

d  -h  / 

ij.q  kj.q  8ik.qj'Skk.qj 

8. 

dik.  q 

s 

Sik.  q/  Skk.  q 

0. 

Sij.qk 

3 

ij.q  8  Ik.  q  8kj,  q^8kk.  q 

10. 

CU.qi  j 

3 

Cij.qi  jk  “Cik.  qi  jk  Cjk,  qi  jk^kk, 

11. 

bkl.qj 

B 

—  c  /  r 

ki.  qj'  'kk.  qi  jk 

12. 

bjl.  q 

3 

bji.  qk  Cik.  qi  bjk.  q/Ckk.  qi  k 

13. 

dkj.  qi 

3 

c  /  c 

kj.  qi  jk'  kk.  qi  jk 

14. 

8kk. qi  j 

= 

i/c,,  . 

kk. qi jk 

15. 

s,  ■ 
kj.q 

3 

b.  /c 

jk. q  kk. qk 

16, 

d.. 
lj.  q 

= 

d.,  — d,.  c  /c 

lj.  q  ik.qj  jk  ;jk'  kk,  qjk 

17. 

Sik.q 

3 

^i.  q^Ckk.  qk 

18. 

a . . 
ij.  q 

= 

s,.  ,  _d.,  b  /c 

lj.qk  ik.q  jk.  kk.  qk 

THEOREM:  * 


The  matrix  A^,  defined  above,  contains 
partitions  having  elements  as  follows: 

four  partitions,  the  respect- 

a.  .  = 

i  J 

°i  i.  12. . .  k* 

1,  2,  •  •  •  kj  j  —  1 ,  2 ,  •  •  •  k 

a.  .  = 

i  J 

b.  .  .  .  i  = 

j  i#  1 2.  .  •  i~l,  i"^l »  » «  k 

1 ,  2,  i  *  i  k|  j  “  k+ 1 ,  kt  2,  *  • «  n 

(8) 

a.  ,  = 

i  J 

di  j.  12. . -  i— 1 ,  i+1. . .  k*  1 

kt  1,  k+2,  set  n,  j  *  1,  2,  •  •  •  n 

a.  ,  = 

i  J 

-  1  I  1  = 

i  j«  1 2*  •  •  k 

k+1,  k+2, .  • .  n,  j  »  k+l,k+2,  .. 

The  consequence  of  the  above  theorem  can  be  generalized  as  follows: 

The  collection  of  variables  whose  subscripts  are  represented  by  the 

values  taken  by  k  in  the  successive  application  of  T  are  said  to  be  in 

K 

regression  if  k  appears  an  odd  number  of  times  in  the  collection.  Alter¬ 
natively,  a  variable  is  said  not  to  be  in  regression  if  its  subscript  does 
not  appear  in  the  collection,  or  if  it  appears  an  even  number  of  times. 

If  the  subscript  appears  twice,  e.  g. ,  the  corresponding  variable  was 
entered  into  the  regression  and  then  removed.  The  nine  recurrence 
formulas,  10.  through  18.  can  be  used  to  prove  that  the  application  of 
the  transformation  T^  to  A^  generates  the  matrix  A^^,  i.  e. ,  the  variable 

is  removed  from  regression  by  the  same  algorithm  with  which  it  is 
entered. 


The  derivation  of  the  eighteen  recurrence  formulas  and  the  proof  of 
this  theorem  are  contained  in  the  author's  Masters'  Thesis,  soon  to  be 
presented  to  the  Graduate  School,  Department  of  Statistics  and  Computer 
Science,  University  of  Delaware,  Newark,  Delaware.  The  thesis  also 
contains  a  discussion  of  storage  saving  considerations  in  the  program¬ 
ming  of  the  procedure. 


.  n 
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The  content  of  the  matrix  at  any  stage  is  as  follows: 


d,  . 

i  J 

-  S.  . 

1  J 

a.  . 

»  J 

II 

a.  . 
i  J 

■  dij 

aij 

a  C.  . 

1  J 

when  neither  X.  nor  X.  are  in  regression 
i  ) 


when  X.  is  in  regression  but  not  X. 
i  J 


when  both  X.  and  X^  are  in  regression. 


CHOOSING  THE  KEY  ELEMENT 


In  forward  stepwise  regression  the  variable  which  is  entered  into 
regression  is  the  one  which  yields  the  greatest  reduction  in  the  variance 
of  residuals  at  that  stage.  For  an  arbitrary  variable  X^  that  is  not  in 

regression  it  is  seen  from  the  recurrence  formula  9.  that  the  variance 
reduction  is  given  by  the  quantity. 

V,  =  a,  a  .  /a.,  as.  s  .  /s.. 

i  in  m  li  in.  q  m.  q  li.  q  (9) 

For  an  arbitrary  variable  X.  that  is  in.  regression  the  variance  increase 

resulting  from  the  removal  of  X.  from  regression  is  given  by  18. 


V.  -  a.  a  .  /a..  =  d  .  b  .  fa... 

l  in  m  li  m.  q  ni.  q  n.  qi 


(10) 


For  X.  not  in  regression  V.  is  positive  and  for  X.  in  regression 
V\  is  negative. 

After  determining  the  key  element  it  is  necessary  to  test  whether 
the  variance  reduction  due  to  entering  the  key  variable  is  statistically 
significant.  By  inspection  of  9.  it  is  seen  that  for  i  =  j  =  n 


s  .  =  s  (l-s  ,  s,  /a  s,.  ) 

nn.  qk  nn.  q  nk.  q  kn.  q  nn.  q  kk.  q 


(11) 


The  quantity  (s  ,  s,  /s  s,  ,  is  defined  as  the  product 

nk. q  kn.  q  nn„ q  kk. q 

moment  coefficient  of  correlation  between  X  and  X, 

n.  q  k.q 
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This  quantity  is  denoted  as  and  is  often  referred  to  as  a  partial 

correlation  coefficient.  Equation  (11)  can  be  written  in  the  form 

2  {li) 
r,  =  s  ,  s,  /s  S  =  (  8  -  s  ,  )/s 

nk.  q  nk.  q  kn.  q'  nn.  q  kk.  q  nn.  q  nn.  qk  nn,  q 

By  inspection  r  ^  gives  the  fractional  variance  reduction  obtained  by 
'  nk.  q 

adding  X  into  the  regression.  If  r  .  is  statistically  different  from 
K  nKe  CJ 

zero,  then  we  observe  that  the  fractional  variance  reduction  due  to  X^ 

is  significant  and  that  X  should  be  brought  into  regression.  For  forward 
2  * 

recursion  r  ^  can  be  computed  directly  from  the  first  expression  of 

(12).  For  backwards  recursion,  i.  e.  ,  to  test  whether  a  variable  X  can 

2  * 
be  removed  from  regression,  r  ^  can  be  computed  from  the  formula 

r2.  *  Vu  /  <  8  u  +  vu  )  (13) 

nk. q  k  nn.  qk  k 

A  test  of  significance  for  r^  is  listed  by  Graybill  [  10],  If  the  true 

coefficient  r  .  ,  for  which  r  ,  is  an  estimate,  is  zero  the  quantity 

nk.  q  nk.  q 

t  =  r  .  (f-2)^  /  (1  -  r 2  )*  (14) 

nk.  q  '  nk.  q 

is  distributed  as  the  Student  t  distribution.  A  test  of  the  hypothesis 

r  ,  ^  0  against  the  alternative  r  ,  =  0  is  performed  as  follows: 

nk.  q  B  nk.  q 

The  quantity  t  is  compared  against  the  one-tailed  t  statistic,  t  (f-2,  c) 

appropriate  to  the  degrees  of  freedom,  f,  and  the  confidence  level,  c. 

The  hypothesis  is  accepted  if  t  >  t  (f-2,  c). 


The  test  is  used  in  two  ways: 


(A)  At  the  beginning  of  a  stage  is  computed  for  all  subscripts  , 
i  =  1,2,...  n-l.  The  largest  positive  V.  identifies  the  key  variable 

which  should  be  tested  for  entering  into  the  regression.  The  quantity 

r  is  computed  using  equation  (12)  and  the  t  test  described  above  is 
nk.  q 

performed.  If  t  >  t  (f-2,  c)  the  variable  X^  is  entered  into  regression  by 
performing  the  transformation  T^. 
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I  X>\  '"PU  _ a.  At  .  .1. _ _  v  .  •  1  • 

'  —  /  *  “w  ^/tt*  i.  VI  w**<o  aio^c  ucgmo  u/y  dg  dill  CUmpUting  V, 

for  all  i.  The  negative  V\  identify  the  variables  that  are  not  in  regression. 

The  negative  V.  of  smallest  magnitude  identifies  the  key  variable  to  test 

for  removal,  r^  is  computed  using  equation  (13).  If  t  >  t  (f-2,  c)  the 

correlation  is  significant  and  the  variable  should  remain  in  regression. 

If  t  <  t  (f“2,  c)  the  variable  can  be  removed  from  regression  without 

significantly  increasing  the  variance  of  residuals.  X,  is  removed  from 

k 

the  regression  by  applying  Tfc  .  The  procedure  is  repeated  until  all 
insignificant  variables  have  been  removed. 

The  modification  of  (A)  and  (B)  above  for  backward  regression  is 
quite  simple.  Initially  the  recursion  is  controlled  to  proceed  all  the  way 
forward,  yielding  the  inverse  of  the  covariance  matrix.  On  the  way  back, 
after  any  variable  is  removed,  the  determination  is  made  as  to  whether  a 
variable  removed  previously  has  become  significant.  If  not,  then  the 
least  significant  variable  in  regression  is  removed,  provided  again  that 
the  resulting  variance  increase  is  not  significant. 


46 


REFERENCES 


1.  Lotto,  G.  ,  On  the  Generation  of  All  Possible  Stepwise  Combinations, 
Mathematics  of  Computation,  Vol.  16,  1962. 

2.  Garside,  M.  J,  ,  The  Best  Sub-Set  in  Multiple  Regression  Analysis, 

Ap  ied  Statistics,  Journal  of  the  Royal  Statistical  Society,  Vol.  XIV, 

1965. 

3.  Breaux,  H.  J.  ,  The  Computation  of  Firing  Tables  for  Guided  Missiles, 
BRL  Report  No.  1348,  November  1966,  Aberdeen  Proving  Ground, 
Maryland. 

4.  Efroymson,  M.  A.  ,  Multiple  Regre s sion  Analysis ,  Mathematical 
Methods  for  Digital  Computers,  Edited  by  Ralston  and  Wilf,  John 
Wiley  and  Sons,  Inc.  I960. 

5.  Breaux,  H.  J.  .  Campbell,  L.  W.  ,  Torrey,  J.  C.  ,  Stepwise  Multiple 
Regression  -  Statistical  Theory  and  Computer  Program  Description, 

BRL  Report  No.  1  330,  July  1966,  Aberdeen  Proving  Ground,  Maryland 

6.  Wilcoxin,  W.  L.  ,  Wohlever,  J.  R.  ,  An  Improved  Stepwise  Regression 
Analysis  Procedure,  Report  No,  Y-F01 5 - 1 5-06-5 1  3 ,  IF  S.  Naval 
Civil  Engineering  Laboratory,  Port  Hueneme,  California. 

7.  Dixon,  W,  J,  ,  Biomedical  Computer  Programs  Health  Sciences 
Computing  Facility,  University  of  California,  Los  Angeles,  September  1, 
1965. 

8.  Abt,  K.  ,  Gemmill,  G,  ,  Herring,  T.  ,  Shade,  R.  ,  DA-MRCA:  A  Fortran 
IV  Program  for  Multiple  Linear  Regression,  Technical  Report  No.  2035, 
U.  S.  Naval  Weapons  Laboratory,  Dahlgren,  Virginia,  March  1966. 

9.  Yule,  G,  U.  ,  Kendall,  M.  G.  ,  An  Introduction  to  the  Theory  of  Statistics, 
Charles  Griffin  and  Company,  London,  1940. 

10.  Graybill,  F.  A.  ,  An  Introduction  to  Linear  Statistical  Models,  Vol.  I, 
McGraw  Hill  Book  Company,  Inc.  ,1961. 


47 


ESTIMATION  OF  ERROR  RATES  IN 
DISCRIMINANT  ANALYSIS* 


Peter  A,  Lachenbruch  and  M.  Ray  Mickey 
University  of  North  Carolina 
Chapel  Hill,  North  Carolina  and 
University  of  California,  Los  Angeles,  California 

ABSTRACT.  Several  methods  of  estimating  error  rates  in  Discriminant 
Analysis  are  evaluated  by  sampling  methods.  Multivariate  normal  samples 
are  generated  on  a  computer  which  have  various  true  probabilities  of 
misclassification  for  different  combinations  of  sample  sizes  and  different 
numbers  of  parameters.  The  two  methods  in  most  common  use  are  found 
to  be  significantly  poorer  than  some  new  methods  that  are  proposed. 


-'This  article  is  to  appear  in  Technometrics. 


SOME  STATISTICAL  APPLICATIONS  IN  THE  TESTING 
OF  MILITARY  VEHICLE  RUBBER  COMPONENTS* 


Emil  H.  Jebe 
Willow  Run  Laboratories 

The  University  of  Michigan,  Ann  Arbor,  Michigan 

SUMMARY.  This  paper  utilizes  the  results  of  four  test  programs  for 
rubber  components  of  military  veh;cles  to  illustrate  a  variety  of  statistical 
applications.  Twc  of  the  programs  were  concerned  with  the  testing  of 
rubber  bushings,  an  element  of  the  track  for  track-laying  vehicles.  A 
third  program  was  conducted  to  evaluate  experimental  types  of  track  pads 
while  the  fourth  example  discussed  reliability  evaluation  for  track  pads  and 
track  shoes. 

Two  of  the  test  programs  were  based  on  experimental  designs  suggested 
by  the  author  while  the  other  two  may  be  described  as; 

(1)  A  factorial  arrangement  for  two  factors  with  missing  treatment 
combinations. 

(2)  A  "road  test"  without  controls  or  any  basis  for  comparative 
evaluation. 

The  statistical  applications  described  for  these  test  programs  include 
the  following; 

a.  Unweighted  least  squares  analysis, 

b.  Orthogonal  polynomials  for  unequal  spacing  of  a  factor, 

c.  Use  of  the  Kronecker  or  Direct  Product  of  matrices  to  form 
the  Contrast  or  Design  Matrix, 

d.  Weighted  Least  Squares  analysis, 

e.  Use  of  a  single  replicate  with  confounding  in  a  3x3x2x2  experi¬ 
ment  for  four  factors , 

f.  Estimation  of  experimental  error  by  a  number  of  techniques, 
e.  g.  ,  regression  residuals,  Half  Normal  Plot,  etc.  , 

g.  Use  of  "uniformity  trial"  analyses  of  data  from  previous  tests, 
to  design  a  new  experiment, 

*Willow  Run  Laboratories,  Project  07  312,  Institute  of  Science  and  Technology, 
The  University  of  Michigan.  Prepared  under  Contract  No.  DA-20- 1 1 3-AMC  - 
05927(T)  with  USATAC,  Warren,  Michigan.  Revised  10  February  1967. 


h.  Reliability  estimation  based  on 

(1)  The  binomial  distribution,  and 

(2)  Johns  and  Lieberman  (1966)  (Technometrics  8,  135,  February 
issue). 

INT RODUCTION.  Since  1962  the  author  has  had  a  unique  opportunity 
to  participate  in  a  number  of  investigations  of  rubber  products  for  military 
applications.  These  studies  have  been  conducted  by  engineers  of  the 
Components  Research  and  Development  Laboratories  (CRDL),  Research 
and  Engineering  Directorate,  USATAC.  My  participation  has  been  through 
several  contracts  between  The  University  of  Michigan  and  USATAC.  Among 
the  types  of  products  investigated  have  been  bushings,  pads,  shoes  and  tires. 
The  latter  needs  no  definition,  but  the  other  three  are  components  or 
elements  of  the  track  for  our  tracked  vehicles,  e.  g.  ,  tanks  and  personnel 
carriers. 

A  few  words  of  non-military  explanation  may  be  helpful  for  these 
components.  The  rubber  bushing  is  bonded  to  a  track  link  pin.  A  close 
fitting  metal  tube  is  squeezed  over  the  rubber  bushings  which  are  bonded 
in  clusters  of  2,  3  or  more  on  the  pin.  This  assembly  is  then  inserted  into 
a  cylindrical  opening  in  the  track  shoe.  Addition  of  center  guides  and  end 
connectors  to  a  group  of  shoes  makes  possible  the  assembly  of  a  complete 
track.  The  rubber  bushing  is  a  key  element  in  this  complete  assembly  in 
that  it  provides  a  non -lubricated  bearing  and  a  load  taking  element  such 
that  the  vehicle  can  travel  at  high  speed  without  prohibitive  noise.  Another 
key  part  of  the  track  is  the  friction  and  load  bearing  surface  between  the 
vehicle  and  the  road.  The  outer  face  of  the  track  shoe  provides  this  sur¬ 
face.  Again  this  face  of  the  shoe  is  made  of  rubber  but  it  may  be  provided 
in  two  ways.  One  way  is  to  bond  and  mold  rubber  to  the  desired  shape 
directly  on  the  steel  surface  of  the  track  shoe.  Another  way  is  to  make 
shoe  pads  of  desired  shape  and  bolt  them  to  the  track  shoe.  The  pad  is 
made  by  bonding  rubber  on  a  metal  plate  with  welded  bolt  attached. 

As  the  author  understands  the  situation,  polymer  science  and  rubber 
technology  are  not  yet  able  to  predict  reliably  the  outcome  of  many  military 
applications.  The  outcome  of  interest  is  durability  or  life  of  the  component. 
Hence,  various  laboratory  and  field  tests  need  to  be  undertaken  to  investi¬ 
gate  the  suitability  and  durability  of  specific  applications.  Our  participation 
in  these  tests  has  comprised; 

(1)  Analysis  of  laboratory  tests  (without  an  experimental  design 
imposed) , 

(2)  Design  of  experiments  for  laboratory  and  field  tests, 
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(3)  Analysis  of  previous  field  tests  to  obtain  information  for 
designing  new  field  tests, 

(a)  Analysis  of  results  from  dcoigncd  ca^ciuhchUi 
(5)  Estimation  of  reliability  from  road  test  results, 


In  presenting  this  paper  the  author  wishes  to  acknowledge  the  contri¬ 
butions  of  his  colleagues  and  co-workers,  R.  A.  King  and  J.  W.  Curtis. 
Further,  the  strong  support,  encouragement  and  active  interest  of  USATAC 
personnel  has  made  it  possible  to  present  this  report*. 

Least  Squares  Analysis  of  a  6x3  Factorial  Arrangement  for  Rubber 
Bushings  with  Missing  Treatment  Combinations.  The  first  problem  pre¬ 
sented  to  me  concerned  the  analysis  of  results  of  fatigue  testing  a  large 
number  of  rubber  bushings  on  a  laboratory  test  machine.  This  machine  is 
designed  to  simulate  the  actual  field  applications  of  the  bushings.  Adjust¬ 
ments  of  the  machine  permit  variations  of  (l)  the  radial  load  (in  psi)  on 
the  bushing,  (2)  the  angle  of  torsional  twist  (plus  or  minus  in  degrees), 
and  (3)  the  cycling  rate  for  the  selected  load  and  angle.  During  fatigue 
testing  the  rubber  deteriorates  so  that  the  load  squeezed  the  bushing  and 
permits  a  carefully  positioned  microswitch  to  close  and  stop  the  machine. 

A  counter  mounted  on  the  machine  permits  recording  the  torsional  cycles 
to  failure  at  the  time  the  switch  closes. 

Engineers  charged  with  analysis  of  these  data  on  cycles  to  failure  were 
disturbed  or  baffled  by  the  tremendous  spread  or  variability  of  the  results. 
Further,  plotting  of  average  results  showed  a  non-linear  response  (Figures 
1  and  2)  which  made  prediction  appear  extremely  hazardous  [l]  **.  Table  1 
indicates  the  variability  for  two  groups  of  tests.  Table  2  provides  a  general 
summary  of  these  results. 

In  approaching  the  analysis  of  these  data,  one  found  that  no  experi¬ 
mental  design  had  been  imposed  on  the  test  sequence.  Although  it  appeared 


,:'In  this  regard  the  author  wishes  to  mention  Messrs.  P.  L.  Goud.C.  Banton, 
C.  D.  Rose,  F.  Spencer,  E.  Kvet,  R.  Westerman,  and  Miss  C.  Cicillini. 
Statements  and  opinions  expressed  in  this  paper,  however,  are  those  of  the 
author  and  do  not  express  USATAC  position  or  policy.  The  author  also 
wishes  to  express  his  appreciation  for  the  comments  of  Professor  H.  ,B. 
Mann,  Army  Mathematics  Research  Center,  Univ.  of  Wisconsin,  made 
after  the  presentation  of  the  paper  on  19  October  1966. 

’■‘^Numbers  in  brackets  refer  to  references.  These  Figures  1  and  2  are 
reproduced  from  Figures  6  and  8  of  Reference  1. 
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Figure  1.  Fatigue  Life  of  Rubber  Hushing  Track  Pins  as  a 
Function  of  Radial  Loading,  at  Diiierent  Det*reeu 
of  Torsional  Twist  (Figure  6  of  [l] ). 
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0  7  1/2  15  22  1/2 

Torsional  Twist,  +  degrees 

Figure  2.  Fatigue  Life  of  Rubber  Bushing  Track  Pins  as  a 
Function  of  Torsional  Twist,  at  Different  Levels 
of  Radial  Loading.  (Figure  8  of  [il). 


TABLE  1. 

Cycles  to  Failure  ior  Rubber  Bushings  Tested  at  Two  Conditions 


Test  No.  20 

Test  No.  24 

1 

44, 900 

1 

800,000 

2 

42, 700 

2 

1, 326,600 

3 

34, 900 

3 

1,  334,900 

4 

32, 600 

4 

1 ,  372, 100 

5 

41, 500 

5 

200,000* 

6 

40, 200 

6 

1, 638,800 

7 

83,  500 

Load  1950  psi 

8 

35, 000 

Angle  +7.5  degrei 

Load 

1  500  psi 

Angl< 

;  +  22.  5  degrees 

Cycling  Rates  for  Both  Groups  -  255  cpm 


Source:  Table  II  of  [l]  . 


"'Rejected  later  as  an  outlier. 


TABLE  2. 


Layout  of  Rubber  Bushing  Test  Conducted  at  USATAC, 
Warren,  Michigan,  1962 


Kadiai  Load 
(psi) 

+  7.  5 

Angle  of  Torsional  Twist  (Degrees) 

+  15.  0  +  22.  5 

1200 

nll=  ° 

(no  data) 

n12  =  6 

C  =  1052* 

n!3  =  8 

C=  56.6 

R  =  423** 

R  =  40 

y  =  30175 

7  =  1. 7373 

s2  =  0.  00505 

s2  =  0. 01503 

1500 

n21  =  ° 

(no  data) 

n22  =  10 

C  =  403 

n23  =  8 

C  =  44.4 

R  =  127 

R  =  51 

7  =  2.4539 

s2  =  0. 00922 

7=  1.  6284 
s2  =  0. 01643 

1800 

n31  =6 

n  =  8 

32 

n33  =  8 

C  =  2661 

C  =  188 

q  =  25.  5 

R  =  2288 

R  =  164 

R  =  8.8 

y  =  3.  4066 

7  =  2.2586 

7  =  1.4035 

s2  =  0, 01906 

s 2  =  0.  01771 

s2  =  0.  00222 

1950 

n41  =  *+ 

C  =  1294 

°42  r  ^ 

(no  data) 

n43  =  0 

(no  data) 

R  =  1439 

* 

7=3.  1006 

s2  =  0.01361 

2100 

n5i  =8 

C  =  339 

11  =  317 
y  =  2. 5021 

n^2  7  ° 

(no  data) 

n.3  =  0 

(no  data) 

s2  =  0.  C2778 
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Table  2  continued 


2250  n, =  8 

o  1 

n62 

n63  =  8 

C  =  342 

C"  =  76 

~C  =  14.  5 

R  =  4?Q 

R  =  55 

R  =  4.  4 

y  =  2.  5014 

y  =  1, 8697 

y=l.  1582 

• 

s2  =  0. 02930 

s2  =  0. 01296 

s2  =  0.  00201 

■C  =  Average  cycles  to  failure  in  ceil  xlO  .  Cycles  recorded  are 
Torsional  Cycles  for  the  Bushing. 

:R  =  Observed  range  for  Cycles  to  failure  in  cell  x  10 

!y. .  =  2  (log  C.  )/n. . ,  k  =  1 ,  2,  .  .  .  n.. 

k  ljk  1J  1J 

■s2  =  2  (y.  -  y)2/(n..-l) 

k  wijk  ij 

+In  Cell  4,1,  one  test  result  was  rejected  as  an  "outlier". 


? 

] 
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that  a  factorial  arrangement  had  been  desired  for  the  factors  Load  (L) 
and  Angle  (A),  such  a  program  was  not  completed.  Table  2  shows  a  6x3 
layout  but  six.  cello  are  empty;  either  no  laiiures  were  obtained  or  no  tests 
were  conducted.  Thus,  a  least  squares  analysis  became  necessary.  Next, 
the  question  of  homogeneity  of  variance  had  to  be  considered.  Clearly, 
differences  in  dispersion  for  treatment  combinations  as  shown  by  Table  1 
should  be  removed.  Without  previous  experience  in  this  field,  the  writer 
selected  the  log  normal  distribution  as  a  plausible  model  for  the  within  cell 
results.  Cells  3,  3  and  6,  3  were  selected  to  take  a  first  look  at  the  results 
of  the  log  transformation.  In  Table  2  the  respective  ranges  in  original 
scale  were  8,800  and  4,400;  the  shown  for  the  transform  are  0.  00202 
and  0,00201.  Corresponding  results  for  cells  3,1  and  6,1  were  2,288,000 
and  490,000  for  ranges  and  s^  of  0.01906  and  0.02930,  respectively. 
Somewhat  encouraged  by  these  results  the  log  transformation  was 
accepted*  [2]  . 


Plotting  the  transformed  data  further  showed  the  usefulness  of  the 
transformation.  Figures  3  and  4  show  the  transformed  results’1**.  It  is 
seen  that  the  response  is  approximately  linear  for  either  factor  for  a 
selected  level  of  the  other  factor.  Some  interaction  between  the  factors 
Load  and  Angle  was  indicated  by  the  non-parallelism  of  the  straight  lines 
sketched  in  the  figures. 


The  next  step  was  selection  of  a  specific  regression  model  and  writing 
out  of  the  X  matrix.  As  a  preliminary  model,  it  was  assumed  that  a  cell 
mean,  y. . ,  could  be  represented  as: 


y..  =  p  X  +  (3  L.  +  fl  ,L2  +  ;3  A  +  (3  A2  +  p  (L.A.)  +«.. 
ij  ro  o  rl  i  11  i  2  j  22  j  12'  i  y  ij 


with  i  =  1,  2,  ....  6  and  j  =  1,  2,  3,  but  not  over  all  i,  j.  The  missing 
cells  reduced  the  data  vector,  y,  to  dimensions  12x1  (the  12  values  are 

shown  in  Table  2).  Small  variations  in  the  n,.  and  variations  in  values 

U 

were  ignored  at  this  stage  so  that  the  were  assumed  to  have  uniform 
variance.  ^ 


*  Later  the  author  became  acquainted  with  some  of  the  relevant  literature 
and  concluded  that  the  procedure  adopted  is  reasonably  robust  against 
certain  alternative  models  [2]  . 

,!’;'Figures  7  and  9  from  Reference  [l]  . 
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Fatigue  Life,  Log  of  Torsional  Cycles 


Figure  3.  logarithm  of  Torsional  Fatigue  Life  as  a  Function  of 
Radial  Load,  at  Different  Degrees  of  Torsional  Twist 
(Figure  7  of  [l]). 


Fatigue  Lile.  Log  erf  Torsional  Cycles 


A  convenient  coding  for  Angle  is  seen  to  be  -1,  0,  +1  since  the  spac¬ 
ing  was  uniform  at  intervals  of  +7.  5  degrees.  This  coding  for  A  also 
uses  the  orthogonal  polynomial  ooafficier.tc  for  the  linear  effci_u  uf  A. 
Similarly,  a  convenient  coding  for  Load  was  found  by  taking  150  lbs  as  the 
unit  and  centering  on  1800  as  zero.  The  coded  values  became  -4,  -l,  0, 
-IT,  +2,  +3.  With  these  values  of  coded  A  and  L,  our  first  X  matrix 
appears  as  in  Table  3. 

TABLE  3.  X  Matrix  for  Preliminary  Model  Fitted  to  Rubber 
Bushing  Data  (Response  =  Average  of  Log  Cycles 
to  Failure) 


X 

o 

L 

L2 

A 

A2 

LA 

1 

0 

0 

-1 

1 

0 

1 

+  1 

1 

-1 

1 

-1 

1 

+2 

4 

-1 

1 

-2 

1 

+  3 

9 

-1 

1 

-3 

1 

-4 

16 

0 

0 

0 

1 

-2 

4 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

+  3 

9 

0 

0 

0 

1 

-4 

16 

+  1 

1 

-4 

1 

-2 

4 

+  1 

1 

-2 

1 

0 

0 

+  1 

1 

0 

1 

+  3 

9 

+  1 

1 

+  3 

T 

From  Table  3,  the  A  matrix  =  X  X  is  obtained  as  Bhown  in  Table  4. 


TABLE  4. 

Cross-Product  Matrix  A 
Equations:  AB  =  G’ 

for  Solution  of  Normal 

X 

0 

L 

L2 

A 

A2 

AxL 

12 

0 

72 

0 

8 

-9 

0 

72 

-54 

-9 

3 

15 

72 

-54 

804 

15 

43 

-81 

0 

-9 

15 

8 

0 

3 

8 

3 

43 

0 

8 

-9 

-9 

15 

-81 

3 

-9 

43 

From  the  solution  of  the  normal  equations  AB  =  G,  which  is  given  by 
B  =  CG,  where  C  =  A-1,  the  regression  coefficients  obtained  are  shown 
in  Table  7  under  First  Equation.  The  summary  analysis  of  variance 
appears  in  Table  5. 

TABLE  5.  Analysis  of  Variance  for  Fitting  Preliminary 
Model  to  Mean  of  Log  of  Cycles  to  Failure 


Source  of  Variation 

Degrees  of 
Freedom 

Sum  of 
Squares 

Total 

12 

66.4389 

Mean  (correction  term) 

1 

60. 9171 

Reduction  in  Sum  of  Squares 
for  Regression 

5 

5. 3943 

Remainder 

6 

0. 1275 

Within  Cells  (from  Table  2) 

77 

0. 18268* 

Extension  of  Table  5  (based  upon  fitting  the 
matrix  given  in  Table  8): 

Z  model  with  design 

Add  Reduction  in  S.  S. 

3 

0. 07387 

Remainder 

3 

0. 05363 

*In  this  compact  notation,  G  =  X  Y  where  Y  is  the  vector  of  means  given 
in  Table  2  in  six  rows  and  three  columns. 

**The  actual  within  cells  sum  of  squares  was  1.096091;  a  divisor  of  6  has 
been  used  to  place  the  Remainder  SS  and  Within  Cells  SS  on  a  comparable 
basis. 


63 


The  results  presented  above  are  incomplete  or  inadequate  in  three 
respects.  First,  the  regression  coefficients  are  correlated;  one  would 
like  orthogonal  estimates  of  the  effects  of  Load  and  Angle  and  their 
interaction.  Second,  tne  Kemamder  Mean  Square,  0.0212,  obtained  from 
Table  5,  when  compared  with  the  Within  Cells  Mean  Square,  0.  00237, 
indicates  a  lack  of  fit  for  the  regression  equation  used  (F  value  -  8.9  with 
6  and  77  degrees  of  freedom).  The  Within  Cells  Mean  Square  used  here 
may  be  an  underestimate  of  the  proper  experimental  error  due  to  the  lack 
of  randomization  in  this  test  program.  Third,  there  is  the  homogeneity 
of  variance  problem  already  noted  in  relation  to  Table  2. 

In  considering  the  first  point,  non-orthogonality  of  the  estimates, 
one  possible  approach  might  be  to  use  a  "Missing  Value"  formula  and  fill 
in  the  six  empty  ceils.  Without  blocking  applied  in  the  experiment,  the 
standard  formula  for  any  experimental  design  could  not  be  used  to  fill  in 
the  missing  treatment  combinations.  Rather  naively  at  the  time,  I  assumed 
that  plausible  estimates  might  be  obtained  by  applying  the  Randomized 
Complete  Blocks  formula  for  a  missing  datum  to  the  rows  and  columns 
of  the  two-way  layout  for  the  factors  Load  and  Angle,  By  iterative  applica¬ 
tion  of  this  formula,  the  six  empty  cells  were  filled.  Then  a  second 
regression  equation  was  obtained.  It  was  found,  however,  that  predictions 
from  this  recond  equation  were  much  worse  than  for  the  first  equation. 

For  the  same  12  observed  points,  the  sum  of  squares  of  deviations  was 
0.  33025,  about  three  times  the  remainder  sum  of  squares  of  0.  12750  shown 
in  Table  5  for  the  First  Equation. 

Why  was  this  decrease  in  "goodness  of  fit"  observed  even  though  we 
now  had  orthogonally  estimated  regression  coefficients  (given  in  Table  7 
under  the  column  headed  Second  Equation)?  If  there  had  been  only  one  or 
two  missing  treatment  combinations,  perhaps,  the  results  would  have  been 
satisfactory.  The  consequence  of  the  application  of  the  Randomized  Com¬ 
plete  Blocks  missing  value  formula  to  be  Load-Angle  two-way  table  was  to 
minimize  the  Load  x  Angle  interaction.  This  interaction  has  10  degrees 
of  freedom  in  this  Load-Angle  table  but  due  to  the  six  empty  cells  only  four 
degrees  of  freedom  can  be  estimated.  Filling  in  the  empty  cells  by 
minimizing  these  four  degrees  of  freedom  apparently  had  distorted  the 
response  surface  so  that  the  goodness  of  fit  achieved  by  the  First  Equation 
was  destroyed.  This  view  of  the  problem  is  supported  by  a  re-examination 
of  Figures  J  and  4,  which  indicate  some  interaction  that  may  be  largely  the 
Load  linear  by  Angle  linear  component,  and  Table  7.  In  the  latter,  the 
values  for  the  b  (linear  by  linear)  regression  coefficient  are  +0.  1038 

and  +0,0278,  respectively,  for  the  equations  being  compared.  This  reduc¬ 
tion,  by  a  factor  of  four  almost,  in  this  component  of  interaction  regression 
coefficient  appears  to  be  due  to  the  minimization  of  the  overall  interaction. 
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These  unsatisfactory  results  for  the  second  equation  posed  a  dilemma 

r  _  ^  rr i  j  _ _  :  „  r - -  *»■.*+■ *.  *%  *•.  A  ~  ~ 

AV  *.  111C  ■  HUVY  euuiu  1UUA  »-  WJ*>*  V/  *  a  »  —  »-  '*»*<'«■  » 

Discussions  with  Professor  Paul  Dwyer*  brought  out  two  suggestions  from 
him.  He  did  remark  that  trying  to  supply  one -third  of  the  observations 
by  the  missing  value  approach  is  "too  much  like  trying  to  pull  yourself 
up  by  your  own  bootstraps".  Essentially,  his  suggestions  were  to  make 
sub-analyses  using  subsets  of  the  twelve  observed  points  to  form  orthogonal 
structures.  The  data  points  used  for  these  analyses  are  shown  in  Table  6. 

TABLE  6.  Data  Points  Used  for  Orthogonal  Sub-analyses  of 
Rubber  Bushing  Fatique  Life  Data 

Angle  of  Torsional  Twist 

First  Sub-analysis  (8  points) 


Load 

+  7.  5 

+  15.0 

+  22.  5 

1200 

0 

X 

X 

1500 

0 

X 

X 

1800 

- 

X 

X 

1950 

- 

0 

0 

2100 

- 

0 

0 

2250 

“ 

X 

Second  Sub-analysis  (6  points) 

X 

1200 

0 

- 

- 

1500 

0 

- 

- 

1800 

X 

X 

X 

1950 

- 

0 

0 

2100 

- 

0 

0 

2250 

X 

X 

X 

Code:  0  indicates  missing  values 

-  datum  observed  but  not  used 
x  datum  used  for  analysis 

* 

Observed  values  appear  in  Table  2. 

’^'Department  of  Mathematics  and  Statistical  Research  Laboratory,  The 
University  of  Michigan,  Ann  Arbor,  Michigan. 
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The  results  of  these  sub-analyses  are  presented  in  terms  of  regression 
coefficients  for  the  "Third  Equation"  and  "Fourth  Equation"  in  Table  1.  -' 

A  study  of  Table  <  shows  that  these  sub-analyses  support  the  results  for  the 
First  Equation.  Perhaps,  one  should  be  criticized  at  this  point  for  not 
presenting  standard  errors  of  the  regression  coefficients.  The  regression 
model  of  the  First  Equation  gave  such  a  good  fit  and  signs  of  the  coefficients 
•were  proper  so  this  model  was  accepted  and  a  report  written  [l]  .  Further, 
extrapolations  attempted  by  the  test  engineer  from  these  accelerated  test 
results  and  the  regression  model  gave  plausible  results. 

Personally,  I  was  not  yet  satisfied  and  I  continued  to  think  about  how 
to  improve  the  analysis.  If  the  data  had  been  complete,  one  could  have 
worked  out  the  orthogonal  polynomial  values  for  the  unequal  spacing  on 
Radial  Load  [3]  .  Forming  the  Kronecker  Product  of  the  Contrast  Matrices 
for  Radial  Load  and  Angle  of  Torsional  Twist  would  then  have  given  an 
18  x  18  contract  matrix  for  a  complete  analysis  in  terms  of  single  degrees 
of  freedom.  From  this  view,  it  occurred  to  me,  "Why  not  proceed  in  this 
way  to  obtain  the  design  matrix  for  the  12  observed  points?"  Details  are 
omitted  but  the  resulting  matrix  is  given  in  Table  8.  Here  is  it  seen  that 
additional  interaction  terms  have  been  added  to  the  model  over  the  First 
Equation  whose  design  matrix  was  given  in  Table  3.  If  we  designate  this 
matrix  in  Table  8  as  Z,  then  a  comparison  of  Z^Z  with  X^X,  given  above 
in  Table  4,  provides  some  basis  for  evaluating  the  fifth  approach  to  the 
analysis.  The  matrix  ZTZ  in  terms  of  its  first  6  rows  and  6  columns  is 
given  in  Table  9,  for  making  this  comparison. 

It  appears  that  most  of  the  off-diagonal  elements  shown  in  Table  9  have 
been  reduced  in  relative  magnitude.  Transformation  of  Tables  4  and  9  to 
the  correlation  matrices  shows  explicitly  that  the  dependence  among 
predictors  has  been  reduced**.  What  this  means  is  that  use  of  the  Z  matrix 
will  give  regression  coefficients  that  are  less  correlated  than  the  coefficients 
obtained  in  the  first  equation.  The  -egression  coefficients  obtained  by  use 
of  Z  appear  in  Table  7  under  "Fifth  Equation".  Extension  of  Table  5  to 
include  the  Z  model  shows  an  added  reduction  in  sum  of  squares  of  0.07  387 
with  3  degrees  of  freedom  leaving  a  new  Remainder  S.  S.  of  0.05363  with 
3  degrees  of  freedom. 

Setting  aside  temporarily  the  inadequacy  of  goodness  of  fit  noted  on 
page  64  ,  we  consider  the  homogeneity  nf  variance  situation.  Even  though 
much  improved  by  the  logarithmic  transformation,  it  is  still  apparent  in 

*Table  X  of  Enclosure  23  [l]  . 

**These  correlation  matrices  have  been  omitted  from  this  paper, 
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TABLE  7. 


List  of  Regression  Coefficients  Obtained  by 
the  Various  Analyses 


First 

Second 

Third 

F  ourth 

Fifth 

Term 

Equation 

Equation 

Equation 

Equation 

Equation 

b 

0 

2. 2494 

2. 2364 

1. 9539 

2. 0997 

2.  5243 

bL 

-0.  1759 

-0. 1526 

-0. 1231 

-0. 1710 

-0.2950 

b  A 

-0. 9395 

-0. 8768 

-0. 9442 

-0. 8366 

-1  .  2321 

bL.L 

+0. 0025 

-0. 0083 

+0. 0042 

— 

+0.  0294 

bAA 

+0. 0994 

-0. 0225 

— 

+0. 0178 

+0.  1272 

bAL 

+0. 1038 

linear  by  linear 

+0. 0278 

+0. 0753 

+0. 1100 

+0.  2780 

bLAA 

linear  by  quadratic 

+0. 0109 

... 

— 

-0.  0682 

bLIA. 

auadratic 

x  linear 

+0. 0105 

-0. 0120 

-0. 0207 

-0.  0391 

bLLAA 

quadratic 

+0.0071 

x  quadratic 

— 

— 

+0.  0078 
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TABLE  8.  Design  Matrix  Based  on  Forming  Orthogonal  Polynomials 


For 

Load  and 

Anorl 

_  Rn KK u 

R  1 1  n  U  i  *i  /v 
-  — 

HI  'ipc  x* 

.,  --  i. 

out 

Lin. 

Quad. 

Lin. 

Quad. 

LxL* 

LxQ 

QxL 

QxQ 

X 

o 

X, 

X1  1 

X2 

X22 

X1X2 

X1X22 

X11X2 

X11X22 

Load 

Angl 

t 

Interactions 

1 

0 

-5. 667* * 

-1 

+  1 

0 

0 

5.  667 

-5. 667 

1 

1 

-3. 608 

-1 

+  1 

-1 

+  1 

3.  608 

-3.  608 

1 

2 

+Q. 451 

-1 

+  1 

-2 

+2 

-0. 451 

+0. 451 

1 

3 

+6. 510 

-1 

+  1 

-3 

+  3 

-6.  510 

+  6.  510 

1 

-4 

+6. 098 

0 

-2 

0 

+8 

0 

"12. 196 

1 

-2 

-3.  784 

0 

-2 

0 

+4 

0 

+7.  568 

1 

0 

-5. 667 

0 

_2 

0 

0 

0 

+11.  334 

1 

+  3 

+6.  510 

0 

-2 

0 

-6 

0 

-1  3.  020 

i 

-4 

+6. 098 

+  1 

+  1 

-4 

-4 

6.  098 

+  6.  098 

1 

-2 

-3. 784 

+  1 

+  1 

-2 

-2 

-3.  784 

-3. 784 

1 

0 

-  5 .  667 

+  1 

+  1 

0 

0 

-5.  667 

-5.  667 

1 

+  3 

+  6.  510 

+  1 

+  1 

-1  3 

•13 

+6,  510 

+  6.  510 

LxL  =  linear  by  linear 
LxQ  =  linear  by  quadratic 
QxL  =  quadratic  by  linear 
QxQ  =  quadratic  by  quadratic 

See  referei  co  3  for  computation  o£  values  in  this  column. 
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T 

TABLE  9.  The  Matrix  Z  Z  with  Last  Three  Rows 
and  Columns  Deleted 


12.  0 

0 

4.  0 

0 

0 

-9.  0 

0 

72.0 

22.24 

-9.  0 

+9.  0 

+  15.  0 

4.  0 

22.24 

339.71 

+  5.  47 

-5.47 

-14.  12 

0 

-9.0 

+  5.47 

+8.  0 

0 

+3.0 

0 

+9.0 

-5.47 

0 

+24.  0 

-9.0 

-9.0 

+  15.0 

-14. 12 

+3.  0  ' 

-9.0 

43.  0 
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he  re  fore,  a. 


Table  2  that  the  y  differ  considerably  in  precision.  T 

ij 

weighted  least  squares  analysis  is  indicated.  Two  methods  of  weighting 

were  used.  One  method  used  the  individual  s!".  shown  in  Table  2,  i.e.  , 

ij 

0.01906,  0.01361,  etc.,  with  w,.=n../s?.  .  The  other  method  pooled  sums 

ij  AJ  lJ 

of  squares  for  the  cells  where  the  s..  were  similar  in  magnitude,  and 

obtained  a  set  of  three  values  of  s^  to  be  used  with  the  n..  of  Table  2  to 

2  P  O 

obtain  the  set  of  w. .  =  n.  ,/s 

ij  IJ  P(ij) 

Calculations  using  these  two  sets  of  weights  just  described  were 
repeated  with  the  Z  model  already  given  above.  The  regression  coeffi¬ 
cients  obtained  are  presented  in  Table  ?A  (Sixth  Equation  results  are  for 
use  of  the  individual  s^.  ■,  Seventh  Equation  results  refers  to  the  pooling 

1J  2 

method  to  obtain  only  three  different  values  of  s  used  in  forming  the 
weights).  It  is  seen  that  the  regression  coefficients  obtained  by  the  two 
different  weighted  least  squares  analyses  are  quite  similar;  differences 
observed  are  less  than  or  of  the  order  of  the  standard  errors  of  the  differ¬ 
ences.  Further  comparison  of  the  regression  coefficients  with  those 
obtained  for  the  Fifth  Equation,  given  in  Table  7,  reveals  some  differences 
that  may  be  judged  statistically  significant.  In  terms  of  practical  applica¬ 
tion  for  making  predictions  of  bushing  fatigue  life  there  may  be  little  to 
choose  between  these  three  equations.  In  view  of  the  somewhat  more 
reliable  weights  used  to  obtain  the  Seventh  Equation  results,  a  statistical 
choice  would  lead  to  this  equation,  other  things  being  equal. 

The  results  for  the  weighted  regression  analyses  also  permit  comment 
on  the  goodness  of  fit  issue,  which  was  deferred  above.  The  lower  section 
of  Table  7A  displays  the  Residual  Mean  Squares  for  the  Fifth,  Sixth  and 
Seventh  Equations.  With  only  three  degrees  of  freedom  available  for  esti¬ 
mating  these  quantities,  no  sharp  judgments  can  be  made.  Qualitatively, 
the  weighted  analysis  has  reduced  the  residual  variation  by  more  than  a 
factor  of  two,  and  the  goodness  of  fit  has  clearly  been  improved.  Yet  the 
ratio  of  Residual  to  Within  Cells  is  still  large  (P<  0.  05).  If  one  does  regard 
the  Within  Cells  as  an  underestimate  of  the  experimental  error  as  noted 
above,  then  one  may  conclude  that  a  satisfactory  fit  has  been  obtained  with 

2  2 

::The  Sp^j)  indicates  the  value  of  s  used  for  each  cell  after  the  pooling 

operation.  The  author  is  indebted  to  Ralph  A.  King  for  this  suggestion 
for  obtaining  more  reliable  weights.  The  sample  weights  obtained  by  the 
two  methods  appear  to  be  the  best  surrogates  available  for  the  <r?,  which 
are  unknown.  ■* 
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TABLE  7A,  Regression  Coefficients  Obtained  by  Weighted 
Least  Squares  Analysis  Using  Model  Z  and 
Residual  Mean  Squares  for  Three  Equations 


Regression  Coefficient 

Sixth  Equation 

Seventh  Equation 

b 

o 

+2.4982 

+2.4980 

bL 

-0.2830 

-0.  2805 

bA 

-1. 2093 

-1. 2002 

bLL 

+0. 0282 

+0. 0272 

bAA 

+0. 1217 

+0. 1140 

bAL 

+0.2596 

+0. 2552 

bLAA 

-0.0618 

-0. 0601 

bLLA 

-0.0331 

-0. 0328 

bLLAA 

+0.0052 

+0.  0056 

Residual  Mean  Squares 

Source 

Degrees  of  Freedom 

Mean  Square 

Fifth  Equation 

3 

0. 01787 

Sixth  Equation 

3 

0.00693 

Seventh  Equation 

3 

0. 00740 

Within  Cells 

77 

0. 00237 

,1! 


the  weighted  analysis.*  This  concludes  the  story  on  the  first  rubber 
bushing  analysis. 

Design  ami  Analysis  of  a  3x3x2x2  Experiment  on  Rubber  Pushings. 

After  completion  of  the  earlier  work  described  above,  an  opportunity  arose 
to  design  an  experimental  program  for  learning  more  about  rubber  bushings. 
At  first,  a  rather  ambitious  program  was  considered  which  would  have 
involved  "experiments  with  mixtures "  (Refer  Scheffe  [4  ]  and  [5]  and  more 
recent  papers  in  Technometrics).  Suitable  bushings  prepared  from  mix¬ 
tures  c*f  natural  and  synthetic  rubbers  could  not  be  obtained  at  the  time. 
Other  parameters  to  be  varied  in  the  experiment  may  be  described  as 
Process  variables  and  Test  variables.  It  was  desired  to  retain  two  levels 
each  of  Radial  Load  and  of  Angle  of  Torsional  Twist  to  provide  a  check  on 
the  results  for  these  factors  as  reported  above.  These  were  the  Test 
variables.  As  Process  variables,  three  levels  each  of  Cure  Temperature 
and  Cure  Time  for  production  of  the  bushings  were  to  be  tested.  Thus, 
the  factorial  arrangement  became  a  3x3x2x2  which  requires  36  tests  for 
a  single  replicate.  Two  replicates  would  have  required  72  bushings  to  be 
tested  which  I  regarded  as  too  large  an  experiment.  After  some  thought  I 
recommended  a  single  replicate  to  be  carried  out  in  a  completely  ran¬ 
domized  design.  At  this  point  the  problems  began.  Complete  randomiza¬ 
tion  for  the  production  and  testing  of  the  36  rubber  bushings  was  regarded 

*The  author  believes  that  some  comment  on  rows  2  and  3  versus  row  1 
of  the  lower  part  of  Table  7A  may  be  helpful.  Many  texts  describe 
weighted  regression  analysis  but  none  with  which  I  am  familiar  include 
a  discussion  on  comparison  with  the  unweighted  analysis.  With  the  w^ 
values  defined  as  explained  above,  the  author  was  confronted  with  residual 
sums  of  squares  for  the  Sixth  and  Seventh  Equations  that  apparently  pro¬ 
vided  no  basis  for  comparison  with  the  figure  given  in  Table  5  as  0.  05363 
with  3  degrees  of  freedom.  Understanding  came  finally  in  appreciating 
the  difference  in  metric.  While  all  three  sums  of  squares  represent 
Euclidean  distances  in  n- space,  the  scale  was  different  for  each.  The 
so-called  "unweighted"  least  squares  analysis  in  reality  has  a  sum  of 
weights  equal  to  n,  12  in  this  problem.  Hence,  it  was  necessary  to  re-scale 
the  residual  sums  of  squares  for  the  weighted  analyses  by  the  factor 
n/2  £  wjj  or  12/2  Zwjj.  These  open  problems  ,  the  choice  of  scale,  the 
estimation  of  weights,  and  more  generally,  the  broader  problem  of  trans¬ 
formation  of  response  to  obtain  an  optimal  analysis  appear  to  merit 
continuing  attention. 


as  impractical,  too  costly,  and  too  time  consuming.  My  arguments  for 
complete  randomization  did  not  convince  the  engineers  that  it  should  he 
adopted.  Then  we  started  to  examine  possible  compromises.  Complete 
randomization  for  Cure  Time,  Radial  Load  and  Angle  of  Torsional  Twist 
could  be  carried  out.  Cure  Temperature  involved  bringing  the  cure  press 
(heated  Platten  Press)  to  the  desired  temperature  and  holding  it  there 
for  the  ncessary  Cure  Time.  The  engineers  wanted  to  reduce  the  number 
of  times  for  a  press  cycle  to  a  minimum.  Now,  the  Cure  Temperature 
could  have  been  made  a  Main  Plot  treatment  in  a  Split-Plot  design  with 
the  3x2x2  arrangement  utilizing  12  split-plots.  Replication  on  Cure  Tem¬ 
perature  would  then  have  forced  the  total  size  of  the  experiment  back  to 
at  least  72  bushings. 

At  this  stage  it  appeared  to  me  that  some  type  of  replication  for  Cure 
Temperature  must  be  included  in  the  test  program.  A  study  of  Kemp- 
thorne's  book  provided  a  possible  solution  (Reference  6).  The  12  split- 
plot  treatments  were  divided  into  two  main  plots  of  six  split-plots  each  by 
confounding  the  Load  by  Angle  (linear  x  linear)  interaction  with  the  main 
plots’'1.  This  contruction  of  the  design  required  only  six  press  cycles  with 
six  bushings  cured  in  each  run,  two  at  each  of  the  three  Cure  Times. 

Analysis  of  the  resulting  data  when  this  test  program  had  been  com¬ 
pleted  was,  of  course,  considerably  more  complicated  than  that  outlined 
by  Kempthorne  since  we  imposed  an  added  factor  at  three  levels  (refer 
pp.  351  -355  of  (6)  ).  Details  are  given  in  Reference  7. 

Here,  I  shall  only  try  to  describe  some  of  the  major  features  of  the 
analysis  and  interpretation.  The  actual  layout  of  the  program  for  the  36 
experimental  units  is  given  in  Table  10.  It  is  to  be  noted  that  I  insisted 
on  equal  spacings  for  the  three  levels  factors:  Cure  Temperature  at  306, 
315  and  324  degrees  F.  and  Cure  Time  at  15,  30  and  45  minutes.  Such 
equal  spacings  make  the  analysis  much  easier  but  should  not  be  required 
for  all  test  programs. 

Our  first  approach  to  the  analysis  was  to  write  out  a  Design  Matrix 
that  included  the  General  Mean,  Blocks,  all  main  effects  and  all  two-factor 
interactions.  Full  column  rank  was  maintained  for  this  matrix  by  the  usual 
devices,  orthogonal  polynomials  for  the  contrasts  and  subtraction  of  the 

*See  Table  18.5,  p.  35C,  [6]  ,  The  three  Cure  Temperatures  were  * 
randomly  assigned  to  the  Replicates  shown  in  the  table  and  the  main 
plots  in  each  block  for  Cure  Temperature  became  the  sets  shown  as 
"Block  1"  and  "Block  2"  by  Kempthorne. 
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TABLE  10.  STRUCTURE  AND  RANDOMIZATION  LAYOUT  FOR 


THE  SPLIT -PLOT  DESIGN  WITH  CONFOUNDING. 

Nuniuers  m  parcnlncbcb 

refer  t-u 

replicaies  and  blocks 

in  Kempthorne.  [6] 

Block  1 

Block  2 

Teir.pe 

rature  306  (2-2) 

Temperature 

315  (3-2) 

t 

L 

A 

t 

L 

A 

1 

0 

1 

13 

1 

1 

1 

27 

1 

1 

0 

14 

0 

1 

1 

28 

2 

0 

0 

18 

0 

0 

0 

25 

2 

1 

1 

17 

2 

0 

1 

29 

0 

1 

1 

15 

2 

1 

0 

26 

0 

0 

0 

16 

1 

0 

0 

30 

Te 

mperature 

315  (3-1) 

Temperature 

324  (1-1) 

t 

L 

A 

t 

L 

A 

0 

0 

1 

3 

2 

0 

1 

33 

2 

1 

1 

5 

0 

0 

0 

35 

1 

1 

0 

1 

0 

1 

1 

34 

0 

1 

0 

2 

2 

1 

0 

31 

2 

0 

0 

4 

1 

1 

0 

36 

1 

0 

1 

6 

1 

0 

1 

32 

Temperature 

324  (1-2) 

Temperature 

i  306  (2-1) 

t 

L 

A 

t 

L 

A 

2 

1 

1 

10 

1 

0 

0 

19 

1 

0 

0 

12 

1 

1 

1 

20 

2 

0 

0 

7 

0 

1 

0 

23 

0 

1 

0 

9 

0 

0 

1 

24 

0 

0 

1 

8 

2 

1 

0 

22 

1 

1 

1 

11 

2 

0 

1 

21 

Each  group  of  six  rubber  bushing  receives  the  cure  temperature  indicated. 
Two  groups  at  the  same  temperature  form  the  complete  set  of  12  =  3x2x2 
for  the  split-plot  treatments.  For  t,  L,  A  the  symbols  designate, 
respectively,  0  =  15  min.  ,  1  =  30  min.  ,  2  =  45  min.  ;  0  =  180^,  1  =  220^; 

0  =  jt  7.  5  degrees,  1=^9  degrees.  Mote  t  =  Cure  Time,  L  =  Radial  Load, 
and  A  =  Angle  of  Torsional  Twist.  The  4th  column  in  each  grouping  shows 
the  randomization  order  for  taking  the  observations  over  the  entire 
experiment. 
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column  for  block  2  from  the  column  for  block  1.  The  resulting  matrix 
was  36  x  21.  Least  Squares  was  then  applied  to  estimate  these  21  effects 

or  their  regression  coefficients.  Again,  the  transformation  Y.  =  log  C. 

i  j 

(where  C.  is  cycles  to  Failure)  was  employed.  Original  data  and 
logarithms  to  base  10  appear  in  Table  11.  The  regression  coefficients 
obtained  are  listed  in  Table  12. 

An  interpretation  of  these  results  is  given  by  quoting  three  paragraphs 
from  reference  7; 

"From  the  analysis  of  variance  we  may  deduce  that  the  regression 
equation  comprising  blocks,  main  effects  and  two-factor  interactions 
provides  a  good  fit  to  the  data.  A  little  over  97%  of  the  total  varia¬ 
tion  about  the  mean  is  associated  with  these  effects  leaving  only  about 
3%  of  this  total  as  residual  variation. 

"F rom  the  inverse  matrix  (obtained  in  the  course  of  the  regres¬ 
sion  computations)  it  is  found  that  all  of  the  effects  listed  in  Table  3 
are  orthogonal  (i.  e.  ,  independent)  except  Blocks  and  the  Load  x 
Angle  interaction.  These  two  effects  have  a  small  correlation  and 
are  independent  of  the  other  19  effects  listed.  Further,  the  diagonal 
elements  of  the  inverse  matrix,  c^,  are  the  elements  needed  for 
obtaining  the  standard  errors  of  the  regression  coefficients. 
Specifically,  the  standard  errors  are  given  by  (cii)^/^se ,  where  s 

c 

is  the  standard  deviation  of  the  residuals,  given  as  0.  0908.  These 
standard  errors  range  from  about  0.012  to  0.023.  Thus,  it  is 
found  that  Radial  Load,  Angle  of  Torsional  Twist  and  Cure  Time 
(Linear)  which  show  the  largest  effects  in  relation  to  their  sampling 
errors,  should  be  regarded  as  real  or  significant  effects.  On  the 
other  hand,  the  Cure  Temperature  (Linear)  coefficient  is  slightly 
smaller  than  twice  its  standard  error;  thus,  it  may  be  regarded  as 
a  significant  effect.  Interestingly  enough,  two  of  the  interaction 
coefficients  are  fairly  large  in  relation  to  their  sampling  errors. 

These  are  Temperature  x  Time  (Linear)  and  Time  (Linear)  x  Angle. 

"From  the  signs  of  the  regression  coefficients,  one  may  obtain 
the  direction  of  the  effect.  Cure  Temperature  has  a  positive 
coefficient  so  we  conclude  that  a  higher  temperature,  i.e,  ,  324 
degrees  F.  ,  is  to  be  preferred.  The  quadratic  coefficient  for 
temperature  is  negative  which  is  to  be  expected.  Turning  to  Cure 
Time,  the  coefficient  is  negative  so  that  a  shorter  cure  time  is  best, 
i.e.  ,  15  minutes.  Here  the  quadratic  coefficient  is  positive,  but 
not  reliably  estimated.  The  Load  and  the  Angle  coefficients  are  both 
negative  as  expected;  thus,  increasing  the  level  of  either  shortens 
the  fatigue  life.  " 
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TABLE  11.  FATIGUE  LIFE  OF  RUBBER  BUSHINGS.  Original 

Data-Cycles  to  Failure  and  Logarithms  of  these  Values. 


Bushing 

No. 

Cycles 

Logarithm 

Bushing 

No. 

Cycles 

Logarithm 

1* 

114, 300 

5.  0580** 

19 

279,900 

5.4470 

2 

173, 300 

5.  2388 

20 

36,000 

4.  5563 

3 

134, 100 

5. 1274 

21 

72,600 

4.  B6u9 

4 

246,900 

5.  3925 

22 

109,700 

5.  0402 

5 

36, 600 

4. 5635 

23 

154,500 

5.  1889 

6 

119,200 

5.  0763 

24 

123, 300 

5.  0910 

7 

194,400 

5. 2887 

25 

459, 100 

5.  6619 

8 

127,000 

5. 1038 

26 

59,500 

4.  7745 

9 

231,100 

5. 3638 

27 

42,000 

4.  6232 

10 

32,000 

4. 5052 

28 

67,800 

4.  8312 

11 

27,700 

4.4425 

29 

79,000 

4.8976 

12 

279,200 

5.4459 

30 

257, 100 

5.  4101 

13 

60,200 

4. 7796 

31 

77,500 

4.  8893 

14 

98,800 

4. 9948 

32 

132,000 

5.  1206 

15 

41, 300 

4. 6160 

33 

88,700 

4.  9479 

16 

416,000 

5.  6191 

34 

63, 600 

4.  8035 

17 

34,700 

4.  5403 

35 

722,200 

5.  8587 

18 

254, 600 

5.4059 

36 

165, 700 

5.2193 

''Treatments  applied  to  each  of  these  bushings  were  shown  in  Table  10  - 
refer  corresponding  numbers,  column  4  of  each  main  plot  set, 

“•'^Tabulated  here  with  only  four  decimals  in  the  mantissae.  The  computer 
obtained  natural  logarithms  which  were  converted  to  common  logarithms 
for  ease  of  interpretation. 


TABLE  12.  REGRESSION  COEFFICIENTS  OBTAINED  FROM 
LEAST  SQUARES  ANALYSIS  OF  LOGARITHMS  OF 
CYCLES  TO  FAILURE  FOR  RUBBER  BUSHINGS 


Coefficient1'*' 

Name  of  Effect 

1 

5. 049560 

General  Mean 

2 

0. 019315 

Blocks 

3 

0. 035382 

Cure  Temperature  -  Linear 

4 

-0. 002519 

-  Quadratic 

5 

-0. 141561 

Cure  Time  -  Linear 

6 

0.  017546 

-  Quadratic 

7 

-0. 202377 

Radial  Load  -  Linear 

8 

-0. 244741 

Angle  of  Torsional  Twist 

9 

-0. 051940 

Temp.  (Linear)  x  time  (Linear) 

10 

-0. 010474 

Temp.  (Linear)  x  time  (Quad) 

1 1 

0. 006171 

Temp.  (Quad)  x  time  (Linear) 

12 

0. 005603 

Temp.  (Quad)  x  time  (Quad) 

13 

-0. 011462 

Temp.  (Linear)  x  Load 

14 

0. 002001 

Temp.  (Quad)  x  Load 

15 

-0. 002537 

time  (Linear)  x  Load 

16 

-0. 001801 

time  (Quad)  x  Load 

17 

0. 004561 

Temp  (Linear)  x  Angle 

18 

-0. 021679 

Temp.  (Quad)  x  Angle 

1? 

0. 036779 

time  (Linear)  x  Angle 

20 

0. 001657 

time  (Quad)  x  Angle 

21 

0. 008773 

Load  x  Angle 

’i'lt  is  to  be  noted  that  the  magnitude  of  these  coefficients  depends  on  the 
scale  of  the  effect  used  in  fitting  the  regression.  Thus,  Blocks  were 
coded  as  -1  and  +1;  Temperature  was  coded  as  one  unit  =  9  degrees  F.  ; 
time  was  coded  as  one  unit  =  15  minutes;  one  unit  of  Radial  Load  =  20(20) 
and  one  unit  of  Angle  of  Torsional  Twist  as  0.  75  degrees. 
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Further  effort  in  the  analysis  of  these  data  was  devoted  to;  (1)  Estima¬ 
tion  of  the  main  plot  experimental  error  for  Cure  Temperature,  and  (2) 
estimation  of  the  split-plot  experimental  error  by  various  methods.  It  is 
true  that  the  regression  residual  sum  of  squares  0.  12  3566  with  15  degrees 
of  freedom  giving  a  mean  square  of  0.  008238  is  an  estimator  of  experi¬ 
mental  variation  under  appropriate  assumptions  but  it  is  still  a  mixture  of 
the  main  plot  and  split-plot  components  just  mentioned.  Hence,  the 
sentences  just  quoted  may  not  be  valid  statements  for  judging  the  Cure 
Temperature  effects. 

If  blocks  are  ignored,  it  is  possible  to  estimate  each  of  the  35  individual 
degree  of  freedom  effects  because,  of  the  balanced  structure  for  the  factorial 
arrangement.  Actually  somewhat  more  is  ignored  because  of  the  structure 
of  the  main  plots  for  Cure  Temperature;  some  of  the  higher  order  inter¬ 
actions  for  Cure  Temperature  are  confounded  with  blocks.  To  obtain  the 
sum  of  squares  for  each  of  these  35  effects  the  full  contrast  matrix  was 
prepared  on  the  computer  by  forming  the  Kronecker  or  Direct  Product  of 
the  individual  contrast  matrices  for  Temperature,  Time,  Load  and  Angle 
[8]  [91  .  Our  next  step  was  to  obtain  some  estimates  of  experimental  error 
by  applying  several  techniques  that  have  been  suggested  in  the  literature  in 
recent  yearsflO]  ,  [ll]  ,  [12]  .  Among  those  used  were  Daniel's  "Half 
Normal  Plot"andthe  "Gamma  Plots"  and  ".smallest  ordered  contrasts"  by 
Wilk,  et  al. 

While  the  details  about  the  application  of  these  techniques  would  be 
informative  and  interesting,  only  the  results  are  shown  In  Table  13.  This 
table  shows  the  source  for  the  estimate,  degrees  of  freedom  (actual  or 
approximate),  s^  ano  s  values,  and  how  or  where  obtained  by  a  reference. 
Comparison  values  from  the  earlier  analysis  also  are  given.  Among  the 
problems  encountered  in  making  these  analyses  were  the  rather  large  values 
of  the  sums  of  squares  associated  with  certain  3  and  4  factor  interactions. 

No  satisfactory  explanation  has  been  found  for  such  results*. 

Returning  to  the  problem  of  improving  the  assessment  of  the  Cure 
Temperature  effects,  the  analysis  of  variance  shown  in  Table  14  was 
prepared. 

From  tnis  Table  14  it  could  be  judged  that  the  levels  of  Cure  Tempera¬ 
ture  used  in  this  experiment  did  not  affect  the  Fatigue  Life  of  the  Rubber 
Bushings.  The  presence  of  the  confounding  with  Blocks  already 


*It  is  now  clear  to  the  author  that  the  complete  design  matrix  should  have 
been  constructed  in  order  that  the  matrix  product,  X^X  (39  x  39)  could  have 
been  examined  for  the  nature  and  degree  of  confounding  present. 
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TABLE  13.  Summary  of  Estimates  of  Experimental  Error 


Source 

d.  f. 

2 

s 

s 

Referen 

Regression  Residuals 
(appears  too  large) 

15 

0,  008238 

.  0908 

(7) 

Three  and  Four  Factor 
Interactions 

13 

0. 006628 

.0812 

(7) 

Half  Normal  Plot 

30* 

0. 0067 

.  082 

(10) 

Gamma  Plot  31* 

(conservative  value  determined) 

0. 0071 

.  084 

(11) 

Smallest  Ordered 

Contrasts 

24 

0. 004736 

,  0688 

(12) 

Average  of  24  Smallest 
Ordered  Contrasts 
(optimistic,  appears 

24 

too  small) 

0. 002879 

.  0536 

(7) 

Comparison  Values 

Regression  Residuals** 

6 

0.  1275 

.  3571 

(1) 

Within  Cells** 

77 

0. 0142 

0.  1192 

(1) 

Approximate . 


**  Refer  Table  5  of  this  paper.  Multiplied  up  by  6  for  comparison  with 
the  data  above. 
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TABLE  14.  Analysis  of  Variance  for  Studying  the  Cure 
Temperature  Effect 


Source  of  Variation 

d.  f. 

S.  S. 

Mean  Square 

Blocks 

1 

0. 012272 

. 012272 

Temperature  (Linear) 

1 

0. 030044 

. 030044 

(Quad. ) 

1 

0. 000457 

. 000457 

Error  (from  Temperature  by 
Blocks  Interaction) 

2 

0.  037401 

. 018700 

Other  Effects 

17 

4. 261991 

xxxx 

(by  subtraction  from  regression  analysis) 

Remainder 

13 

0. 086165 

. 006628 

(3  and  4  factor  interactions) 


FIGURE  5.  Cure  Temperature  Effects  in  Blocks  1  and  2. 

Rubber  Bushing  Fatigue  Life  Experiment* 
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*  Points  plotted  are  anti-logs  of  average  logarithms  of  cycles  to  failure. 
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mentioned  which  enters  into  this  error  and  the  small  degrees  of  freedom 
raise  doubts  about  such  a  conclusion.  The  Cure  Temperature  results  by 
Blocks  are  shown  in  Figure  5.  This  writer's  present  opinion  is  that 
further  experimentation  is  needed  with  adequate  replication  of  the  Cure 
Temperature  levels  and  with  wider  spread,  perhaps,  3U0  to  Si  5  degrees  F. 


Evaiuation  of  Experimental  Types  of  Track  Pads.  Attention  is  directed 
to  another  component  of  the  track,  the  track  pad  of  the  Personnel  Carrier. 

We  were  asked  to  design  a  test  program  for  evaluating  14  types  of  experi¬ 
mental  composition  pads.  Two  types  of  production  pads  were  available  as 
controls.  Hence,  16  treatments  were  to  be  evaluated.  Only  one  Ml  1  3 
vehicle  would  be  available  for  carrying  out  an  accelerated  road  test  program. 
A  further  restriction  was  that  only  seven  pads  of  each  of  the  experimental 
types  could  be  made  available  for  this  program.  After  several  conferences 
with  the  interested  engineers,  the  following  resume  was  recorded  (quoted 
from  reference  [3]  ): 


"Two  objective  responses  could  be  measured  for  each  individual 
track  pad: 

(1)  Decrease  in  thickness  (due  to  wear)  of  the  pad  in  respect 
to  its  height  above  the  grouser  shoe  to  which  it  is  bolted 
(later  referred  to  as  "height  Joss"),  and 

(2)  Weight  loss  of  the  individual  pad  from  its  initial  weight. 


Both  of  these  responses  had  been  measured  in  previous  Army  tests 
with  principal  dependence  placed  on  the  weight  loss.  Other  responses 
could  be  considered  such  as  volume  loss  of  the  pad  from  its  initial 
volume  and  subjective  'scores'  or  'ratings'  based  on  chunking  and 
cracking  or  pieces  of  material  broken  off  of  the  pad  during  use. 


"A  recent  test  conducted  by  the  Food  Machinery  Corporation 
(FMC),  San  Jose,  California,  had  utilized  the  height  loss  measure¬ 
ment  for  evaluating  the  results.  Obtaining  these'measurements  had 
been  facilitated  by  the  construction  of  a  special  caliper.  The  level 
surface  of  the  grouser  lug  formed  the  reference  for  this  caliper 
which  was  really  a  type  of  'depth  gauge'.  CRDL  constructed  a 
similar  gauge  for  this  test  program. 


"In  considering  these  responses  it  was  pointed  out  that  it  should 
be  useful  to  examine  the  response  data  in  relation  to  physical  and 
chemical  properties  of  the  pad  material  compositions  for  the  various 
type  of  pads.  Examples  might  be  tensile  streng’.n,  hardness  and 
laboratory  abrasion  resistance. 
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"It  was  expected  that  total  test  driving  of  about  500  miles  would 
be  icquiicd  to  icvedl  differences  ,  if  any  existed,  among  ttie  experi¬ 
mental  type  pads.  Test  driving  would  be  terminated  or  test  pads 
would  be  replaced  if  wear  had  progressed  to  the  point  that  the  metal 
grousers  would  come  into  contact  with  the  road  surface.  Replace¬ 
ment  of  pads,  however,  could  affect  wear  of  the  pads  on  adjacent 
shoes.  Hence,  it  was  recommended  that  replacement  pads  be  pro¬ 
duction  type  pads  whose  height  above  grouser  had  been  gound  down 
or  worn  to  that  of  the  pads  on  adjacent  shoes.  Height  measurements 
of  pads  were  to  be  made;  1)  After  initial  run-in.  2)  Each  100  miles 
thereafter,  and  3)  At  termination.  It  was  suggested  also  that  initial 
and  final  weights  for  individual  pads  be  obtained  for  all  pad  types.  *" 

The  real  problem  encountered  in  setting  up  the  test  program  was 
agreement  on  the  selection  of  an  "experimemtal  unit".  Based  on  their 
"experience",  AT  AC  engineers  tended  to  favor  an  experimental  unit  or  plot 
comprising  a  cluster  of  10  consecutive  pads  of  the  same  type.  The  basis 
for  this  opinion  was  that  averaging  of  results  from  10  pads  would  provide  a 
fairly  stable  average.  The  left  and  right  sides  of  the  vehicle  seemed  to  form 
natural  blocks  for  the  experimental  design.  Obviously,  with  only  seven  pads 
on  hand  for  the  experimental  types,  this  cluster  of  10  could  not  be  obtained. 
Putting  all  seven  in  a  cluster  would  not  permit  replication. 

At  this  point,  it  was  found  that  data  from  previous  tests  conducted  by 
the  Army  at  several  sites  was  on  hand.  These  data  were  obtained  and 
analyzed  from  the  "uniformity  trial"  point  of  view**.  Cluster  sizes  of 
2,  3,  4,  6,  8,  12,  13,  and  18  were  studied  in  these  analyses  with  the 
smaller  clusters  formed  from  the  larger  clusters,  It  was  found  that  size 
of  cluster  did  not  affect  conclusions  for  any  of  the  previous  tests.  A 
peculiar  feature  of  the  Ml  13  vehicle  added  interest  to  the  problem  of  deter¬ 
mining  the  cluster  size;  one  side  of  the  vehicle  has  64  track  shoes  and  the 
other  side  has  63. 

With  the  uniformity  analyses  information  available,  a  cluster  size  of  4 
was  established  on  one  side  so  that  16  x  4  ~  64  and  on  the  other  side  a  cluster 
size  of  3  was  used  with  16  x  3  =  48.  The  remaining  15  pads  on  this  side  were 
filled  in  with  standard  pads  and  limited  supplies  of  a  few  other  experimental 
pads.  Hence,  the  experimental  design  may  be  described  as  a  Randomized 

^Unfortunately,  the  14  experimental  types  of  pads  were  weighed  in  groups 
before  installation  so  that  data  from  this  experiment  do  not  provide  suffi¬ 
cient  information  for  correlation  of  height  loss  and  weight  loss. 

**A  more  sophisticated  approach  would  have  calculated  auto-correlations 
of  weight  losses  for  adjacent  pads  and  pads  separated  by  1 ,  2,  3  or  more  up 
to  K-2  pads,  where  K  was  the  number  used  in  a  cluster. 


Complete  Block  for  16  treatments  in  two  replicates  with  each  track  of  the 
vehicle  forming  a  block.  While  normal  driving  provides  a  natural  randomi¬ 
zation  on  thp  wnav  nf  flip  pads,  a  different  randomisation  was  used  for  the 
treatments  on  each  track. 

No  difficulties  were  experienced  in  carrying  out  the  500  mile  accelerated 
test  program.  There  were  some  dcubts  in  my  mind  about  the  scheduled  500 
miles  being  sufficient  to  show  up  differences  among  the  treatments  since 
other  Army  tests  had  comprised  total  mileages  of  1000,  1500,  or  2000  miles. 
The  program  could  not  be  extended  for  this  test,  however,  because  the 
vehicle  had  to  be  returned  to  another  agency. 

With  respect  to  analysis,  we  followed  the  suggestions  of  George  Box 

(1950)  on  analysis  of  growth  and  wear  curves  [  1 4 ]  .  Differences  between 

successive  measurements  of  pad  height  above  grouser  were  formed,  e.  g.  , 

H  -  H  for  i  =  1  through  5.  These  differences  appeared  to  be  reasonably 
i  i- 1 

distributed  so  the  analysis  of  variance  was  applied  directly  to  these  differ¬ 
ences  without  transformation.  In  order  to  help  understand  the  analysis  of 
variance  (Table  15),  Figure  6  explains  the  structural  arrangement  of  these 
differences. 

It  will  be  noted  that  Tabie  15  shows  only  14  degrees  of  freedom  for 
treatments  (Types  of  Pads);  this  happened  because  only  one  Control  Type 
was  available  when  the  driving  program  was  started.  This  one  control  Type 
was  duplicated  on  each  track,  To  simplify  the  computer  programming,  only 
one  cluster  in  each  block  was  used  for  the  Control  Type  of  Pad.  A  list  of 
means  for  the  Pad  Types  in  each  block  and  overall  is  given  in  Table  16. 

Now  what  about  interpretation?  Statistically,  I  was  quite  pleased  with 
these  results.  We  used  Multiple  Comparisons  Procedures  to  group  the 
experimental  pad  types  into  significantly  different  groups  [l9]  .  Our  next 
step  was  to  try  to  relate  the  values  of  these  means  to  other  available  physical 
and  chemical  data  on  the  experimental  Types  of  Pads.  Unfortunately,  no 
significant  regressions  could  be  obtained.  Hence,  it  is  my  personal  opinion 
that  there  is  still  room  for  a  lot  of  research  on  military  track  pads  in  order 
that  we  can  find  the  determinants  of  lor.ger  life  for  this  element  of  the  vehicle 
track. 

Reliability  Analysis  for  Track  Components.  As  a  final  example  in  this 
paper  I  shall  present  briefly  some  attempts  at  reliability  evaluation.  ' 
Recently,  the  Army  has  conducted  some  "road  Testing"  of  a  new  track 
design  for  the  tank.  This  new  design  comprised  a  track  made  up  with  track 
shoes  whose  grouser  shape  was  formed  by  replaceable  pads.  Road  testing 
was  conducted  at  three  sites  using  three  vehicles  at  one  site  and  two  each 
at  the  other  sites,  or  a  total  of  seven  vehicles.  Total  distance  driven 
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FIGURE  6.  Structural  Arrangement  of  Differences  for  Analysis 
of  Height  Loss  of  Track  Pads 


1.  Block  =  Side  of  Vehicle 

2.  Treatments  (16)  randomized  over  plots  in  each  block, 

3.  Plot  =  Cluster  of  3  or  4  consecutive  pads  (3  on  left  side; 

4  on  right  side). 

4.  Split-Plot  =  Unit  of  travel  (100  miles)  (labeled  as  Period 

in  Analysis  of  Variance). 

5.  Individual  Pad  =  Subsampling  unit  within  the  split-plot. 

6.  Height  of  Pad  recorded  at  0,  100,  200,  300,  400,  500  miles 

7.  Differences  taken  for  each  pad  for  each  increment  of  wear 

(100  miles)  giving  a  total  of  7  x  16  x  5  =  560  differences. 
Difference  =  Height  Loss. 
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TABLE  15.  Analysis  ot  Variance  of  Track  Pad  Test  Results;  15  Pad 
Types  Mf  mted  on  Both  Tracks  of  Ml  13  Personnel  Carrier 

with  T  ,nce  Uoaenrorl  fn  ■*.  7  ~ -f  HP _ _  _ 

O  -  - - - - -  *  *  - - -  A  ^  VVt^J. 

5  Periods  of  100  Miles  Each* 

Source  of 

Variation 

Degrees  of 

F  reedom 

Sum  of 
Squares* * 

Mean  F** 

Square  Ratios 

Uncorrected  Total 

525 

299566 

— —  . 

Correction  Term  for 
Overall  Mean 

1 

209121 

209121  - 

Sides  of  Vehicle 

1 

392 

392 

Types  of  Pads 

14 

50696 

3621  ~91 

Error  (a) 

14 

555 

39.  6 

Periods 

4 

7655 

1914  ~2  3 

Types  x  Periods 

56 

7681 

137  ~  1.7 

Periods  x  Sides 

4 

250 

62 

Types  x  Periods  x  Sides 

56 

4675 

83 

Error  (b)’1'** 

60 

4925 

82.  1 

Pads  Within  Types 

Left  Side 

30 

2406 

80 

Right  Side 

45 

2203 

49 

Pads  Within  Types  x  Periods 

Left  Side 

120 

9588 

80 

Right  Side 

180 

4344 

24 

*  Variable,  analyzed  is 

Height  Loss  for 

a  single  Period  of  each  individual 

pad  within  a  Type,  i.  e,  ,  7  pads  for  each  Type  x  15  Types  x  5  Periods 
gives  525  measurements  of  Height  Loss.  Units  are  the  same  as  in 
Table  16,  but  squared  here. 

’'  "'Addition  may  not  check  in  this  column  because  of  rounding  to  whole 
numbers  in  sums  of  squares  for  each  source  of  variation.  » 

. ’"E:rror  (b)  is  sum  of  two  preceding  sources  which  appear  to  be  homogeneous. 

.  . . .  ratios  are  computed  using  Error  (a)  for  Sides  and  Types,  and  Error  (b) 

for  Periods  and  Types  x  Periods.  This  procedure  conforms  to  the  split- 
plot  structure  of  the  experimental  plan  with  Periods  considered  as  the 
split-plot  treatments. 
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TABLE  16.  Average  Height  Losses  of  Track  Pad  Types  for  500  Mile 

Test  Propram*  Rased  nn  3  Pads  nn  Left  Ride  and  4  Padc  nn 
Right  Side.  Averages  are  in  Thousandths  of  an  Inch: 


14. 63  =  . 01463" 

Type  No. 

CRDL  Code 

Left  Side 

Right  Side 

Ove  rail 

Rank* 

1 

S131C2F2 

16.  33 

13.  35 

14.  63 

6 

2 

Z 1 38 

17.  67 

17.  75 

17.71 

11 

3 

Z138C 

13.  87 

13.  60 

13.71 

3 

4 

Z138C1F 

14.  60 

16.  95 

15.  94 

8 

5 

Z138CF1 

14.  53 

11. 00 

12.  51 

2 

6 

Z138F2 

16.  07 

12.  55 

14.  06 

4 

7 

Z 1 2  IF 

30.  33 

28.  10 

29.  06 

17 

8 

Z140 

20.  13 

18.  35 

19.  U 

12 

9 

S131C2F22 

52.  53 

46.40 

49.  03 

19 

10 

Z138C2 

15.  87 

14.  05 

14.83 

7 

11 

Z1  38C3DF3 

16.  93 

16.  80 

16.  86 

10 

12 

Z116CF2 

35.87 

33,  25 

34.  37 

18 

13 

SI  31C2F2BD 

16.73 

15.  65 

16.  11 

9 

14 

Z128CF 

12.  93 

9.  35 

10.  89 

1 

15 

Comm'l  SBR 

19.93 

21. 00 

20.  54 

14 

16 

A- 0 

13.  60  (2)** 

*  15.20(2)’!"!": 

’  14.40 

5 

17 

C-0 

21. 80  (4) 

17.  90  (2) 

20.  50 

13 

18 

C-10 

25.  12  (5) 

— 

25.  12 

16 

15L 

Comm'l  SBR!" 

!<*  20.  71  (7) 

— 

20.  71 

15 

’'Averages  are  calculated  on  a  per  Period  basis.  Multiplication  by  5 
gives  Average  Total  Height  Loss.  Standard  deviation  of  a  Type  Average 
[Error  (a)/(35)]  ^  =  (39.  6/35)*'^  =  1. 063  units  (refer  Table  15). 

,;”'Rank  is  in  order  from  lowest  to  highest  Height  Loss. 

**>:> Numbers  of  Pads  averaged  in  last  four  rows. 

>:< >;< »:< These  added  seven  pads  for  Comm'l  SBR  were  omitted  from  Type  15 
in  the  variance  analysis  to  simplify  the  programming  and  weighting  of 
data  problems.  Comm'l  SBR  is  designated  as  the  principal  control. 
Type  C-10  is  considered  a  secondary  control. 
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TABL  17,  Order  Statistics  for  Pad  Set  Miles  with  Some 
Summary  Statistics 


Line  No.  Statistic  No. 


Miles  Completed 


Summary  Statistics 


1 

389 

2 

904 

3 

939 

4 

1 

1667 

5 

2 

1994 

6 

3 

2096 

7 

4 

2250 

8 

5 

2250 

9 

6 

2300 

10 

7 

2338 

11 

8 

2430 

12 

9 

2496 

13 

10 

2570 

14 

11 

2628 

15 

12 

2677 

16 

13 

2706 

17 

14 

2750 

18 

n  =  1  5 

2813 

Total 

35,965 

First  three  values  not  counted 
Mj,  smallest  value 
Q^,  first  quartile 

Median 

Q^,  third  quartile 

M^,  largest  value 
Mean  about  2400  miles 


•f 
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TABLE  18.  Reliability  Estimation  for  Pad  Sets  Under  the 
Test  Conditions  of  the  Program 

Lower  Confidence 
Estimate  for  Limit 


m  =  2000  mile  s 

0 

"(y  =  0.  90) 

A. 

Binomial 

0.  933 

0.  764 

B. 

Johns  and  Lieberman 

0.  914 

0.  811 

M  =  0 

o 

C. 

Three  parameter 

0.  955 

0.881 

approximation 

M  a  1667 
o 
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exceeded  35,000  miles.  In  covering  this  distance,  complete  pad  sets  were 
replaced  wrien  worn  u  uL ,  aiiu  aumc  iiiu.tviu  uetl  Liack.  shucb  were  rcpiaccu 
although  no  complete  track  set  was  replaced  or  judged  completely  worn  out. 

After  much  thought  about  the  problem,  it  appeared  to  us  that  a 
reliability  statement  might  be  made  about  the  pad  sets  and  for  the  first 
track  shoe  replacement  on  each  vehicle.  Table  17  shows  the  order  statistics 
data  for  the  18  pad  sets  used  [15]  .  Three  short  mileages  were  omitted 
from  our  analysis  for  obvious  reasons  and  one  value  of  1994  miles  was 
counted  as  a  "success"  in  attaining  2,000  miles.  It  is  to  be  noted  that  our 
reliability  estimates  apply  to  the  conditions  of  the  road  test  and  not  to  Army 
use  in  general.  In  Tabic  18  the  results  are  shown  for  three  approaches  to 
the  problem  [15]  *.  Johns  and  Lieberman  refer  to  their  recent 
T echnometric s  paper  [16]  .  The  binomial  result  is  for  1 4/ 1  5  =  0.  933  and 
use  of  a  binomial  table  [17]  ,  The  third  result  is  a  crude  approximation 
that  I  obtained  from  the  Johns  and  Lieberman  approach. 


Information  about  the  first  track  shoe  replacements  is  given  in  Table  19 
[15]  **.  Again,  an  observation  has  been  omitted  in  the  analysis. 


TABLE  19.  Mileage  at  Replacement  of  First  Track  Shoe  During 
Road-Testing  of  New  Track  Design  of  Seven  Vehicles 


Vehicle 

Number 

1 

2 

3 
1 

4 

5 

6 
7 

Median  =  3315; 


Replacement 

Mileage 

904*** 
274  5 
2992 
3000 
3315 
3686 
392  5 
4894 


Number  of 
Shoes  Replaced 

30*** 

1 

2 

4 

1 

% . . .  ' 

2 

1 

1 


Average  =  3508 


’’■■These  Tables  17  and  18  are  based  on  Tables  4.  1  and  4.  3  of  [15]  . 

**This  Table  19  is  derived  from  Table  6.  2  of  [  1 5 ]  . 

*** Datum,  not  used  because  entire  track  was  thrown;  damaged  shoes  were 
replaced. 


/ 
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Table  19  shows  the  mileages  arranged  as  order  statistics;  vehicle 
numbers  are  arbitrary  designations.  From  the  lowest  value,  2745  miles, 
and  the  sample  size,  n  =  Y,  we  may  estimate  with  50%  confidence  that  907o 
of  vehicles  road-tested  under  similar  accelerated  conditions  will  have  their 
first  track  shoe  replacements  after  2745  miles.  This  result  is  a  non- 
parametric  tolerance  limit  [18]  .  If  a  higher  confidence  statement  is 
desired,  then  the  tolerance  proportion  or  reliability  stated  must  be  lowered. 
For  90%  confidence,  the  figure  becomes  72%  first  track  shoe  replacements 
after  2745  miles,  which  is  a  one-sided  binomial  limit  [  1 7 ]  .  One  would 
like  to  apply  the  Johns  and  Lieberman  technique  to  these  track  shoe  date 
but  the  smallest  sample  size  for  which  they  have  worked  out  their  tables 
is  n  =  10  [16]  . 


REFERENCES 


1.  C.  D.  Rose,  !" laboratory  Investigation  on  Fatigue  Life  of  Rubber 
Bushing  Track  Pin  Assemblies ,  "  Report  No.  7908,  USOTAC, 

16  July  1963.  {Supplement  on  "Analysis  Procedure"  by  Emil  H,  Jebe.) 

2.  M.  Zelen,  "Factorial  Experiments  in  Life  Testing,"  Technometrics  1, 

269  (1959).  Also,  see  Zelen  and  Dannemiller,  "The  Robustness  of  Life 
Testing  Procedures.  same  journal,  3,  29  (1961). 

3.  D.  S.  Robson,  "A  Simple  Method  for  Constructing  Orthogonal 
Polynomials  when  the  Independent  Variable  is  Unequally  Spaced,  " 
Biometrics  15,  187  (1959). 

4.  H.  Scheffe,  "Experiments  with  Mixtures,"  Jour.  Royal  Stat.  Soc.  , 

Series  B,  20,  344  (1958).  -  - 

5.  H,  Scheffe,  "The  Simplex-Centroid  Design  for  Experiments  with 
Mixtures,  "  Jour.  Royal  Stat.  Soc.  ,  Series  B,  25,  235  (1963). 

6.  O.  Kempthorne,  "Design  and  Analysis  of  Experiments.  "  J,  Wiley  and 
Sons,  New  York,  (1952). 

7.  E.H.  Jebe  and  R.  A.  King,  "Investigation  of  Cure  Time  and  Cure 

T emperature  Eifects  on  Fatigue  Life  of  Rubber  Bushings  for  Track 
Link  Pins,  "  Report  prepared  for  Projects  05895  and  07312,  Willow 
Run  Laboratories,  The  University  of  Michigan  under  Contract 
DA-20-0 18 -AMC-0954T  with  USATAC. 

8.  P.  R.  Halmos ,  "Finite  Dimensional  Vector  Spaces,  "  Princeton 
University  Press,  Princeton,  6th  Printing  ( 1953)  (see  Appendix  II  for 
discussion  of  the  Kronecker  or  Direct  Product,  pp.  170-182). 


89 


9.  E.  H.  Jebe  and  R,  O,  Bennett,  Jr.  ,  "The  MATRIX  MACROS  Program,  " 
(in  process).  This  set  of  macros  or  subroutines  includes  two  special 
Kronecker  Product  routines. 

10.  C.  Daniel,  "Use  of  Half  Normal  Plots  in  Interpreting  Factorial  Two- 
Level  Experiments,  "  Technomctrica  1,  311  (1959). 

11.  M.  B.  Wilk,  et  al,  "Probability  Plots  for  the  Gamma  Distribution,  " 
Technometrics  4,  1  (1962). 

12.  M.  B.  Wilk,  et  al,  "Estimation  of  Error  Variance  from  Smallest 
Ordered  Contrasts,  "  Journ.  Amer.  Stat.  Assn,  ,  58,  152  (1963), 

13.  E.  H.  Jebe  and  R.  A.  King  and  J,  W.  Curtis,  "T-130  Track  Pad  Test 
Program:  Experimental  Design  and  Analysis  of  Test  Results,  " 

Informal  report  prepared  under  Projects  05895  and  07  312,  Willow  Run 
Laboratories,  The  University  of  Michigan  under  Contract  No.  DA-20- 
11  3- AMC -0592  7(T). 

14.  G.  E,  P.  Box,  "Problems  in  the  Analysis  of  Growth  and  Wear  Curves,  " 
Biometrics  6,  362  (1950). 

15.  Report  to:  OCRDL,  USATAC,  Subject:  "Preliminary  Report  on 
Examination  of  Data  from  the  Track  Improvement  Program,  "  by 
E.  H.  Jebe,  J.  W.  Curtis  and  R,  A.  King,  28  June  1966,  Prepared 
under  Project  07312,  Willow  Run  Laboratories,  The  University  of 
Michigan  under  Contract  No.  DA-20- 1 1  3-AMC -05927(T). 

16.  M.  V,  Johns  and  G.  J.  Lieberman,  "An  Exact  Asymptotically  Efficient 
Confidence  Bound  for  Reliability  in  the  Case  of  the  Weibull  Distribution,  " 
Technometrics,  8,  1  35  (1966). 

17.  J.  R.  Cooke,  M,  T.  Lee  and  J.  P.  Vanderbeck,  "Binomial  Reliability 
Table  (LCL)  for  the  Binomial  Distribution,  "  NAVWEPS  Report  No,  8090, 
NOTS  TP  3140,  China  Lake,  California,  Jan.  1964. 

18.  D.  B.  Owens,  "Handbook  of  Statistical  Tables,  "  Addison-Wesley, 
Reading,  Mass.  (1962). 

19.  G.  W.  Snedecor,  "Statistical  Methods,"  Ames,  Iowa,  ISU  Press  (1956) 
Refer  section  10.6,  p.  251. 


A  STATISTICAL  ANALYSIS  OF  PROVISIONING 
PROCESSES  ON  FOUR  ARMY  MISSILE  SYSTEMS 


Robert  G.  Provost 

U.  S.  Army  Missile  Command,  AMSMI-WK 
Redstone  Arsenal,  Alaoama 


[The  author  presented  a  series  of  slides  at  the  conference.  These 
slides,  with  the  information  about  each,  are  reproduced  ir  this 
article .  ] 

SLIDE  1  -  Title  slide 

2  -  Schematic  of  PDS  routes 
i  -  Station  ident 

4  -  Matrix 

5  -  Route  ident 

6  -  Bottom  of  PDS 

7  -  Blow-up  of  matrix  -  1  cell  and  title  blocks 

8  -  F  igurc  of  head! 

9  -  Tukey  -  Dixon  -  Snedecor 

10  -  Matrix  w/avg  station  lengths 
1  1  -  400  PDS  sample 
12  -  Axe  head  cutting  time  in  half 
1  i  -  The  end 
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stage  to  the  production  stage,  changes  will  occur  as  a  result  of  value 
engineering  applications,  changes  in  technology,  improved  materials  and 
hardware  items,  preproduction  engineering,  and  the  discovery  of 
inadvertent  errors.  These  changes  are  incorporated  through  the  use  of 
and  Engineering  Order  (an  EO).  In  one  stage  of  the  EO,  a  Provisioning 
List  (PL)  is  generated  which  subsequently  ends  up  in  our  Supply  and 
Maintenance  Directorate  as  a  PDS  (Provisioning  Data  Sheet).  A  Pro¬ 
visioning  List  contains  all  the  parts  needed  to  support  the  change,  whereas 
a  PDS  is  a  computer-produced  sheet  for  each  part  listed  on  the  PL.  It 
is  used  as  a  worksheet  to  identify  that  part  within  the  Federal  Cataloging 
System. 

Due  to  the  different  types  and  classes  of  parts,  and  to  the  priority 
required,  a  PDS  will  flow  through  this  portion  of  the  Sly  M  Directorate  along 
different  routes.  Also  due  to  the  lack  of  different  kinds  of  information ,  a 
PDS  will  take  still  other  routes. 

Current  regulations  provided  a  90-day  time  limit  to  process  a  PDS 
through  the  S&  M  Directorate.  PDS's  which  exceeded  this  time  limit  were 
considered  delinquent. 

Since  a  PDS  represents  a  single  line  item  in  a  PL  and  a  PL  could 
contain  from  one  to  1000  or  more  line  items,  any  PDS  which  exceeded  the 
time  limit  caused  the  entire  PL  to  become  delinquent.  Management  was 
concerned  about  these  delinquencies  and  wanted  to  know,  since  each  PDS 
flowed  through  various  routes  and  stations,  what  the  average  length  of  each 
station  and  route  actuaily  was,  and  could  the  90-day  time  limit  be  reduced. 
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determination  of  the  various  routes.  This  slide  is  a  schematic  of  these 
routes.  Six  basic  stations  were  identified:  station  number  1  through 
station  number  6.  Each  station  performs  one  or  more  functions  during 
the  flow  cycle  of  the  PDS.  These  are  identified  by  the  alpha  characters 
after  each  station  number.  For  example,  Station  4  has  but  a  single  func¬ 
tion  whereas  Station  2E  indicates  that  this  is  the  fifth  function  performed 
by  that  station.  By  counting  all  possible  combinations  of  routes  in  this 
schematic,  one  can  easily  determine  that  there  are  fifteen  different  routes 
for  an  ADP  initiation,  and  the  same  number  of  routes  for  a  local  initia¬ 
tion,  This  is  true  for  a.  single  type  of  a  PDS,  but  due  to  priorities,  there 
are  three  types  of  PDS's  to  consider.  These  are  coLored  lor  easy 
identification:  a  white  PDS  for  routine  or  low  priority  items,  a  yellow 
PDS  for  high  priority  items,  and  a  green  PDS  for  emergency  items. 

Adding  those  various  types  of  PDS's  into  the  schematic  ,  a  maximum  of 
forty-five  different  routes  is  now  possible  for  each  initiation.  To  compli¬ 
cate  matters  still  further,  a  green  PDS,  used  for  emergencies,  is  also 
used  as  a  delinquency  flag.  Should  any  PDS  remain  in  this  portion  of  the 
processing  cycle  beyond  a  specified  period  of  time,  a  delinquent  green 
PDS  is  initiated  locally.  This  is  rushed  through  the  system  until  it  reaches 
that  station  in  which  the  original  PDS  is  bogged  down.  Since  this  delin¬ 
quent  green  PDS  can  travel  along  any  route,  our  total  maximum  number  of 
possible  routes  now  stands  at  sixty.  Add  to  these  routes  the  fact  that 
occasionally  a  yellow  PDS,  during  the  processing  cycle,  can  be  downgraded 
to  a  lower  priority,  that  is,  downgraded  to  the  status  of  a  white  PDS.  The 
processors  when  confronted  with  this  action  would  hand  stamp  the  yellow 
PDS  not  with  a  "downgrade"  stamp,  but  with  one  called  "PEPSODENT"  -- 
you  wonder  where  the  yellow  went! 
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STATION  IDENTIFICATIONS  &  FUNCTIONS 


CL 

<£ 

>- 

Cl 

_ J 

Cl 

C_> 

CL 

on 

l — 

X 

z 

on 

h- 

o  us 

on 

F-  *-<  ca 

z 

z 

X 

z  on  z 

or 

Lu  • 

CD  LU 

F- 

i— 1  on  <t 

i 

Oj 

=3  O 

on 

<C 

OO 

Z  UJ 

to 

z 

on  on 

on 

O  -J 

US 

lu 

o  »on  • 

a: 

1 — 1  l — « 

o  z 

«c 

Q.  <£  UJ  X 

on  u. 

y 

us 

F—  Z  uj 

oa 

X 

<  on 

«c 

on  <.  Lu  F— 

_J  DC 

uj  Q  F—  on 

UJ 

UJ  UJ 

on  x 

F-  UJ  >- 

X  uj  ~J  on 

UJ 

Z  F- 

LU  LU 

o 

ac 

t— .  on 

ac  ac 

UJ 

££§:x 

x 

<c 

<t  =3 

on 

X 

o 

DC  X 

a.  CL- 

Z 

<_) 

on 

O 

uj 

O 

Ll  on 

CL  O. 

* — t 

*£ 

to  UJ  u_ 

CL 

LU 

CL  =3 

F— 

—  F-  oc 

UJ 

on  F- 

UJ 

on  >~ 

o  z  o  on 

a. 

O  < 

» to 

Z 

DC  t_J 

mL  Ll 

O 

CL  C2 

on  (— 

X 

UJ  «f 
CD  £ 

<  >■ 

DC 

Cl 

UJ  LU 

Ll 

•  x  on  o 

CL 

X  x 

•—  to 

Son  UJ  LU  X 

uj  *-•  oc  o 

i —  =»  or 

zu.ua  u_ 
onaz 
uet  «t  z 

uj  »  on 

on  X  »  u. 

z  F—  »■ 

—>  **£.  <_»  on 

C  X  O  <  Z 

F—  uj  £3C  lu 

z  h  ^  a  < 

“  nu  u  F— 

<  ><  u  CD 

X  on  q.  <  o 


=3  O 
f—  as 
m  <  in 
F- 

>»  on  ac 
-J I  £  «C 
Cl  O  cu 
CL  ►- • 
3|-D 
on  m  uj 

on  oc 
to 

UJ  X  X 

sco-a 


un 

_J 

z 

o 

z 

L™. 

on 

CL 

* 

o 

US 

o 

z 

F— 

«— 1 

_J 

■» 

o 

o 

z 

us 

V- 

US 

-J 

Lu 

O 

c 

LU 

z 

< 

F- 

UJ 

L— 4 

o 

DC 

UJ 

F- 

< 

US 

M 

* — 

< 

UJ 

>- 

o 

Lu 

on 

t— 

LL 

_l 

-J 

LJ 

X 

ss 

—J 

CL 

< 

F— 

a 

*-* 

a 

CD 

Ol 

F- 

Z 

z 

< 

Lu 

X 

<c 

g 

UM 

s: 

U. 

CL 

on 

c 

96 


SLIDE  3.  Station  ident.  That  Pepsodent  action  was  generally  per¬ 
formed  in  Station  2.  Perhaps  at  this  point  we  should  examine  the  various 
functions  of  each  station  TMc  elide  depicts  each  of  the  stations 
identifies  their  function(s). 

Station  l,  our  Industrial  station,  is  actually  located  in  the  Procure¬ 
ment  and  Production  Directorate.  This  station  performs  the  validation  of 
each  part  number  to  facilitate  the  finding  of  the  proper  FSN  for  that  part. 
This  station  also  checks  the  part  against  the  drawings  for  accuracy,  and, 
as  need  be,  obtains  new  drawings  as  required. 

Station  2,  Maintenance  Engineering,  is  the  control  station  for  this 
procedure.  Upon  receipt  from  the  computer  they  review  each  PDS  for 
completeness  and  accuracy,  distribute  them  into  system,  verify  mainte¬ 
nance  data  and  assign  pack  data  as  required,  and  after  ail  the  work  has 
been  accomplished  remove  each  completed  PDS  from  system.  They  also 
prepare  the  delinquent  green  PDS,  whenever  any  PDS  is  not  removed  from 
the  system  on  time. 

Station  3  is  Federal  Cataloging.  J.t  is  this  station  which  obtains 
the  FSN  for  each  PDS  from  proper  sources,  either  locally  or  from  outside 
agencies,  and  assigns  this  Federal  Stock  Number  to  each  part  on  receipt. 

Station  4,  the  Publications  station,  extracts  the  pertinent  data 
from  each  PDS  for  inclusion  in  Supply  and  Technical  Manuals.  They  also 
update  the  master  files. 

Station  5,  our  Supply  Control  station,  which  makes  the  necessary 
supply  studies,  prepares  and  submits  requisitions  and  sets  up  the  purchas¬ 
ing  of  required  parts. 

Station  6,  Cataloging.  This  function  of  cataloging  involves  the 
advance  notification  to  our  supply  depots  of  these  various  parts  that  are 
coming  through  our  system. 

It  is  at  this  point  I  should  mention  that  all  PDS's  do  not  lack  an  FSN. 
Some  PDS's  do  not  require  an  FSN  since,  for  example,  the  part  is 
fabricated  or  modified  in  place.  Other  PDS's,  the  bulk  in  fact,  had  the 
proper  FSN  located  by  the  computer  when  the  PL  was  converted  into  the 
various  types  of  PDS's. 


MATRIX  FOR  PDS  SAMPLES 


SLIDE  4.  Matrix.  Returning  to  our  sixty  possible  routes  for  each 
initiation,  each  type  PDS  was  carefully  examined  in  relation  to  all  of  its 
possible  routes  through  this  portion  of  the  S&  M  Directorate  and  instead 
of  some  sixty  possible  routes,  a  total  of  twelve  basic  routes  emerged. 

Four  of  these  basic  routes  were  eliminated  for  reasons  such  as:  the 
item  was  fabricated,  not  purchased;  infrequent  use,  like  once  in  two 
years;  and,  sundry  other  reasons  leaving  us  with  eight  basic  routes. 

These  routes  divided  into  pairs  of  routes,  with  each  pair  having  one 
broadcasting  function;  Either  the  PDS  was  broadcasted  (because  it 
represented  a  MICOM-managed  item),  or  it  was  not  broadcasted.  These 
routes  were  further  subdivided,  by  segregating  the  three  colors  of  PDS's 
into  sub-pairs'of  routes  within  the  basic  routes.  An  adaitiona.1  pair  of 
routes  was  developed  after  discussion  with  the  personnel  of  one  station 
because  either  the  PDS  could  be  handled  in  a  relatively  short  period  of 
time  (30  days  or  less)  or  an  extremely  long  period  of  time  (90  days  or 
more).  Since  we  knew  the  reason  for  this  long  period  of  time  the  data 
collected  for  this  pair  of  routes  were  subsequently  omitted.  This  slide 
shows  the  final  configuration  as  well  as  the  matrix  developed  to  handle 
this  problem.  The  columns  which  identify  the  stations  are  coded  Aj 
through  An  for  subsequent  use  in  a  computer  program.  The  rows  which 
identify  the  routes  are  coded  Bj  through  B14  for  the  same  purpose.  Routes 
Bq  and  Bjq,  which  represent  the  long  cycle  time  of  one  station,  were 
subsequently  dropped  for  the  reason  stated  before.  The  intersection  of 
a  row  and  a  column  is  designated  as  a  cell.  Each  cell  is  divided  into  four 
columns,  one  column  for  each  missile  system  under  consideration.  The 
shaded  areas  represent  those  stations  which  are  not  in  that  specific  route. 
Thus  Route  Bj  only  contains  four  stations,  A| ,  Aq,  A^q,  and  Aj^ ,  whereas 
Route  contains  all  eleven  stations,  Aj  through  All* 
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Cff£EV  PDS—~EKERGEMCT  Off  DELINQUFirfl 


SLIDE  5 


Route  ident . 


This  slide  identities  each  pair  of  routes. 


Route  s 

Route  s 

Routes 

Routes 

validation. 


13^  and  are  for  white  PDS's  which  have  an  FSN. 

B,  and  B,  are  for  white  PDS's  which  do  not  have  an  FSN. 

3  4 

B  and  B,  are  for  green  PDS's  which  do  not  have  an  FSN. 

5  o 

and  are  for  white  PDS's  without  an  FSN  and  require 
This  validation  is  accomplished  in  30  days  or  less. 


Routes  Bg  and  arc  also  for  white  PDS's  without  an  FSN  which 

require  validation  but  this  validation  required  90  days  or  more  to 
accomplish.  For  the  reason  mentioned  before,  these  routes  were  removed 
from  the  analysis. 


Route  B^  and  B^  are  for  green  PDS's  without  an  FSN.  These  differ 

from  routes  Bg  and  in  that  the  FSN  was  not  immediately  located  and 
requires  outside  agencies  assistance. 

Routes  Bj  j  and  are  for  yellow  PDS's  without  an  FSN. 

The  difference  between  each  pair  of  routes  is  that  the  first  route  of 
a  pair  contains  a  broadcasting  function. 


Are  there  any  questions  up  to  this  point  ? 
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SLIDE  6.  Bottom  of  PDS.  This  slide  depicts  the  lower  portion  of 
a  yellow  PDS.  As  you  can  see,  there  are  sections  for  each  station  to 
record  its  completion  date.  When  the  PDS  emerges  from  the  computer, 
it  is  signed  off  by  the  computer  at  location  A.  All  PDS's  that  lack  an 
FSN  or  has  questionable  FSN's  are  then  sent  to  Station  2A  where,  after 
screening  for  initial  dissemination  and  other  actions  which  are  dependent 
upon  which  missile  system  is  involved.  Station  2A  records  its  comple¬ 
tion  date  at  location  B.  Each  PDS  is  then  sent  to  the  next  station  as 
determined  by  Station  2.  In  a  similar  fashion  each  station  records  its 
completion  date  at  its  appropriate  place  on  the  PDS.  The  elapsed  time 
between  successive  dates  is  indicative  of  the  amount  of  time  that  a  PDS 
remained  in  that  station  including  the  transportation  time  to  that  station. 
Since  this  transportation  time  is  essentially  the  same  for  all  stations, 
no  effort  was  made  to  remove  this  small  amount  of  time  involved.  The 
last  station  to  handle  a  PDS  is  Station  2E  which  removes  the  PDS  from 
the  system.  The  PDS's  thus  removed  are  filed,  by  missile  system,  in 
order  of  their  removal.  Since  the  elapsed  time  varies  greatly  from  the 
initiation  of  a  PDS  to  its  completion  date,  the  stack  of  completed  PDS's 
in  each  missile  system  could  be  considered  to  be  in  a  random  sequence. 
However,  to  preclude  any  possibility  of  bias,  when  these  stacks  of  PDS's 
were  sampled,  the  PDS's  were  randomly  selected. 


Best  Available  Copy 
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CELL  CONFIGURATION 


SLIDE  7.  Bicw-up  of  matrix  -  1  cell  and  title  blocks.  A  worksheet 
was  developed  to  record  the  completion  date(s)  of  each  station  for  each 
PDS  selected  for  each  route  previously  identified.  These  samples  were 
replicated  four  times  for  each  missile  system.  After  the  range  of  these 
completion  dates  were  established  through  inspection,  a  pseudo-Juiian 
calendar  (one  which  omitted  all  Saturdays,  Sundays,  and  holidays)  was 
developed  to  permit  the  transfer  of  each  recorded  date  into  the  pseudo- 
Julian  date.  Subtraction  of  these  converted  sign-off  dates  indicated  the 
elapsed  number  of  days  that  each  PDS  remained  in  each  station.  This 
procedure  provided  a  maximum  of  56  measurements  per  missile  per 
station,  with  a  total  of  more  than  1600  measurements  taken  to  fill  the 
matrix.  From  the  slide  one  can  observe  how  each  cell  was  filled  with 
these  real  time  measurements. 

The  first  attempt  to  analyze  this  recorded  data  was  made  through  the 
use  of  an  analysis  of  variance  program,  which  was  borrowed  from  the 
UCLA  Medical  Center,  for  two  main  determinations;  (i)  to  determine  if 
there  were  significant  differences  between  each  pair  of  routes  (by  omitting 
the  broadcast  function  -  Station  6),  and  (2)  to  determine  if  there  were 
significant  differences  between  missiles  as  well  as  colors.  The  results 
from  the  analysis  of  variance  program  run  were  tested  after  proper 
conversions  against  the  "F"  test  for  significance  at  the  95%  and  99% 
confidence  limits.  All  were  essentially  negative,  which  subsequently, 
permitted  the  combination  of  measurements  for  larger  samples.  Unfor¬ 
tunately,  some  difficulty  was  experienced  during  the  computer  run  of  this 
program  (conflicting  statements  in  the  program  and  a  faulty  printer)  which 
delayed  the  computation  of  the  analysis  of  variance  for  these  data. 
Furthermore,  this  particular  analysis  of  variance  program  was  incomplete 
in  that  it  was  not  programmed  to  compute  nor  print  out  the  mean,  the 
variance,  and  the  standard  deviation  for  each  row,  column,  sum  of  rows, 
and  sum  of  columns  nor  the  required  two-way  tables  for  analyzing  signifi¬ 
cant  differences.  During  this  delay,  utilizing  the  original  matrix,  the 
mean,  variance,  and  standard  deviation,  at  95%  confidence  limits,  were 
hand-computed  for  each  missile,  for  each  station,  and  for  each  route 
(each  route  having  been  identified  for  a  single  color)  as  well  as  for  the 
combined  group  of  missiles.  All  of  these  resulted  in  extremely  large 
standard  deviations. 


Bost  Available  Copy 
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SLIDE  8.  Figure  of  head!  This  is  how  I  felt!  I  needed  help!  So, 
at  this  point,  I  k-unLo.1.  leu  Dr.  Harsnoargor ,  our  MICOM  Consultant  from 
VPI.  After  a  review  of  the  data  in  which  he  agreed  to  its  abnormality, 
he  suggested  n  transformation  to  reduce  the  variability  and  that,  in  his 
opinion,  the  data  followed  a  Poissonian,  and  possibly,  a  logarithmic 
distribution  instead  of  being  normally  distributed.  A  few  san^ples  were 
tested  by  computing  the  variance  and  the  mean  to  determine  if  the  variance 
was  proportional  to  the  mean.  The  variance  was  found  to  be  approximately 
proportional  to  the  mean  which  indicated  the  transformation  to  be  utilized 
could  be  the  sq rare  root  of  the  sample  value.  However,  the  results  of 
this  transformation  after  the  necessary  computations  were  completed 
approximated  the  original  results.  Since  some  of  the  data  were  less  than 
unity,  one  was  added  to  each  sample  value  and  the  square  root  transforma¬ 
tion  was  again  attempted.  Once  again,  although  variability  did  decrease 
e  .nificantly,  the  results  still  approximated  the  original  results;  variations 
(standard  deviations)  wore  still  Loo  Large.  And,  I  still  felt  like  this! 


SLIDE  9.  Tukey  -  Dixon  -  Snedecor.  Research  and  consultation  with 
local  statisticians  produced  "Tukey's  Test  of  Additivity,  "  a  procedure 
which  is  fourfold  in  nature:  It  (l)  helps  decide  if  a  transformation  is 
necessary,  (2)  indicates  a  suitable  transformation,  (3)  indicates  if  the 
transformation  was  successful,  (4)  gets  evidence  about  aberrant  observa¬ 
tions.  Application  of  this  test  on  a  few  selected  cells  by  an  experienced 
statistician  indicated  the  transformation  required  was  logarithmic. 

By  this  time,  the  analysis  of  variance  results  had  been  received  from 
the  Computation  Center.  As  previously  stated,  "F"  tests  at  the  95%  and 
99%  levels,  revealed  there  were  no  significant  differences  between  missiles, 
between  colors,  and  between  routes.  There  were  highly  significant 
differences  between  stations  but  these  were  to  be  expected  since  the  work 
content  in  each  station  is  different  and  does  require  different  intervals  of 
time  to  perform.  These  results  from  the  analysis  of  variance  permitted 
the  combination  of  like  routes  for  each  missile  system,  to  obtain  larger 
samples.  However,  for  comparison  purposes,  each  route  was  calculated 
singly  as  well  as  combined  for  each  missile  system. 

To  return  to  the  second  result  of  the  application  of  Tukey's  Test, 
several  observations  were  found  to  be  aberrant.  Unfortunately  Tukey’s 
Test  merely  indicates  aberrance  but  does  not  correct  them.  Through  the 
use  of  Dixon's  "Ratios  Involving  Extreme  Values,  "  a  technique  which 
permits  one  to  determine  if  a  value  is  aberrant,  the  original  matrix  was 
reentered  and  all  values  in  each  cell  were  tested  for  aberrance.  As  each 
aberrant  value  was  discovered,  the  remaining  values  for  that  particular 
cell  were  tested  for  aberrance  until  all  data  were  purified.  The  removal 
of  these  aberrant  values  left  the  matrix  with  several  missing  values.  A 
review  of  techniques  to  replace  these  missing  data  led  to  Snedecor's 
Iterative  Procedure  which  was  subsequently  utilized. 

With  the  matrix  again  complete,  ail  recorded  data  were  transferred 
into  logarithmic  values.  The  necessary  computations  were  then  performed 
by  hand  for  all  routes  and  stations  for  each  missile  system.  These  compu¬ 
tations  resulted  in  significantly  lower  variations  and,  more  significantly, 
after  converting  the  derived  values  back  to  normal  values,  truly  approxi¬ 
mated  the  real  situation;  a  highly  skewed  curve  to  the  left  without  negative 
times . 
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AVERAGE  TIME  (DAYS)  PER  STATION  FOR  ALL  MISSILES 
{ 2.2  DAYS  -  ALL  STATIONS) 


SLIDE  10.  Matrix  w/avg  station  lengths.  Since  the  mean  times 
were  now  available  for  each  station  for  each  missile,  these  stations 
wore  Syiitl11.ai4.eu  liinj  routes  anti  subsequently  corretated  against  the 
average  times  of  the  original  routes.  This  was  accomplished  for  each 
missile  system  as  well  as  the  combined  group  of  missile  systems.  This 
slide  depicts  the  average  station  length  for  all  missiles  and  the 
synthesized  total  for  each  route.  The  coefficient  of  correlation  as 
calculated  proved  to  be  +0.87  which  is  highly  significant  (well  above  the 
99%  level  of  +0.  708  in  a  significance  table  for  my  number  of  degrees 
of  freedom). 


AMPLES  OF  OCCURRENCES  OP  PDS  ON  VARIOUS  ROUTES 


SLIDE  11.  400  PDS  sample.  Current  proc  edures  provided  a  maxi¬ 
mum  Of'  90  rifles  Knfnm  :i  BnQ  I  i  '>  n  >■)  iJ  Linn/...  «  Ill'll  i  • 

•  -  - - - * '  v‘*.i*'.[uc,in,(  w  Hu  t.  l:  lie  ill  L'XCCjjUons 

for  sonic  PDS's  on  a  lew  routes,  no  interim  time  limits  were  specified, 

This  analysis  indicated  that,  on  the  average  (95%  of  the  time),  all  DOS's 
can  be  processed  through  each  station  in  2.2  days,  with  a  range'  from” 

0.  8  days  to  6.  3  days.  A  recommendation  for  the  establishment  of  time 
limits  for  each  station  wouLd  preclude  lengthy  delays. 

Unfortunately ,  S&.  M  cannot  predetermine  which  route  a  PDS  will  flow 
(with  certain  exceptions)  but  on  the  average,  the  longest  route  length 
(without  broadcast)  was  approximately  30  days.  The  final  step  obviously 
was  the  determination  of  the  frequency  of  occurrence  of  this  longest  route. 

In  accordance  with  a  400  PDS  sample  (100  from  each  missile  system), 
this  route  (30  day  length)  occurred  approximately  1  1%  „f  the  time,  This 
slide  illustrates  the  results  of  this  sample. 


SLIDE  12,  Axe  head  cutting  time  in  half.  Since  a  PDS  did  not 
become  delinquent  until  the  expiration  of  90  days  and,  on  the  average, 
our  longest  route  length  was  approximately  10  rlavn  nln»  ^  fn,? 

broadcast  plus  nearly  8  additional  days  for  two  standard  deviations,  or 
a  total  of  approximately  43  days,  45  days  (for  the  sake  of  a  nice  round 
number)  was  selected  for  the  maximum  time  limit.  To  support  the 
recommendation  of  reducing  the  90-day  time  limit  to  a  45-day  time  limit, 
the  probability  that  this  30-day  route  would  exceed  45  days  had  to  be 
calculated.  This  probability,  using  the  t-distribution,  was  calculated 
to  be  0.  04.  Thus  the  probability  that  this  route  length  would  exceed  45 
days  is  13%  times  the  probability  (.  04)  which  is  five  one -thousandths  or 
only  5  times  out  of  1000. 

Consequently,  a  recommendation  for  a  45-day  limit  was  tendered 
in  my  final  analysis. 

Although  this  now  concludes  my  presentation,  I  would  like  to  provide 
you  with  a  very  short  follow-up.  The  45-day  limitation  was  not  accepted 
because  the  powers  in  control  felt  that  this  cut  in  time  was  too  drastic. 
Instead,  a  sixty-daytime  limit  was  substituted.  However,  through 
improved  flow  procedures  and  the  subsequent  recommended  elimination 
of  one  station,  this  60-day  time  limit  was  recently  cut  down  to  45  days. 
Additional  studies  (non-statistical)  are  currently  being  performed  to  effect 
a  further  reduction  in  time. 


I  thank  you.' 


OPTIMAL  ECONOMY  IN  PLANNING  EXPERIMENTS- 
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Biostatistics  Department 
University  of  North  Carolina  at  Chapel  Hill, 

North  Carolina 

ABST RACT.  Suppose  that  a  cost,  y,  (which  is  a  random  variable)  is 
a  non-linear  function  of  some  controlled  variable  x,  and  in  a  general  case, 
is  expressed  as  a  polynomial  of  k-th  degree  in  x.  Let 


,  t 

y  =  c(x)  +  2  ax 

t=l  t 


K  t 

Y  =  E(y  [  x)  =  2  Ax 


be  the  estimated  and  expected  ("true")  cost  functions  respectively.  Let  x 
and  x  be  the  values  of  x  at  which  the  estimated  and  expected  cost  function 

^  ^ 

attain  minima  respectively.  Further,  let  Yq  =  E(y  |  x^)  be  the  actual 
expected  cost  when  xq  is  substituted  for  (unknown)  xq,  and  Yq  =  E(yJxQ) 
the  'true'  minimum  cost.  We  define  the  'allowance'  cost  as 

(3)  E{?0  -  Yo)  =  £  At  [E(S‘)-X*  ]  . 


If  c(x)  estimates  C(x)  closely,  then  (3)  will  usually  be  small. 

To  evaluate  (3)  we  have  to  find  the  distribution  of  x  which  is  a 
function  of  regression  coefficients  a^,  . a  , 

(4)  xq  =  g(a^ ,  a2 . ak)  . 


In  the  general  case  this  may  be  complicated,  but  for  sufficiently  large 
sample  size,  n,  we  can  find  an  approximate  distribution  using  the  Central 
Limit  Theorem  and  a  Taylor  series  expansion  of  the  multivariable  function 

’’This  paper  has  been  accepted  for  publication  in  "Operations  Research". 
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(4).  Application  of  orthogonal  polynomials  appears  to  be  relevant  to  this 
situation. 

It  ia  easy  to  see  from  (jj  that  t;(Yo  -  Y  )  depcr  ,s  on  the  chape  of  the 

true  cost  function, C{x),  even  if  the  fitted  regression  function,  c(x),  is  of 
the  right  order.  Incorrect  choice  of  the  degree  of  r(x)  might  affect  the 
'allowance'  cost  more  severely. 


ON  A  CLASS  OF  NONPARAMETRIC  TESTS  FOR  MANOVA  IN  TWO  WAY  LAYOUTS* 


PRANAB  KUMAR  SEN 


University  of  North  Carolina,  Chapel  Hill, 
and  University  of  Calcutta. 

SUMMARY.  The  object  of  the  present  investigation  is  to  propose  and  study  a  class 
of  nonparametric  tests  for  the  multivariate  analysis  of  variance  (MANOVA)  problem 
relating  to  complete  two  way  layouts.  In  this  context,  the  concept  of  rank-permuta¬ 
tions  for  multidimensional  interchangeability  is  developed,  and  the  same  is 
incorporated  in  the  formulation  of  a  class  of  genuinely  distribution-free  rank  order 
tests.  Asymptotic  properties  of  the  class  of  proposed  tests  are  studied  and  compared 
with  those  of  the  standard  parametric  ones. 

1.  INTRODUCTION 

Let  us  consider  a  complete  two  way  layout  comprising  of  n  complete  blocks 

(replicates) ,  each  block  containing  r(>  2)  plots  where  r  different  treatments  are 

applied.  The  yield  (response)  is  a  p  variate  quantitative  (stochastic)  vector, 

(k) 

and  we  denote  by  the  k-th  response  for  the  jth  treatment  placed  in  the  ith 

block  for  1*1 . .  j  «  1,  . ..,  r,  k  ■  1,  ...,  p.  In  the  sequel,  it  will  be 

assumed  that  n,  r,  p  _>  2.  Let  then 


?ij  "  (xij)f  •••’  1  “  l»  *•*'  n*  j  "  1 . .  (1,1> 


i*  -  <ua> . u<p))i 


(1.2) 


*Work  supported  by  the  Army  Research  Office,  Durham,  Grant  DA-31-124-ARO-D-G4  32, 
This  article  was  reproduced  photographically. 


(1.3) 

(1.4) 

(1.5) 


hi  M  **  h +  Ij  +  hy  1  "  1*  •••■  n>  j  "  1 . .  <I,6) 

where  w  1*  the  vector  of  mean  effects,  the  block  effects  (i  ■  1,  ....  n), 
the  treatment  effects  (j  ■  1,  ....  r),  and  the  residual  error  vectors 

(1  ■  1 . .  j  ■  1,  r) .  These  component  vectors  are  assumed  to  be  mutually 

independent.  Our  problem  is  to  have  a  comprehensive  test  for  the  hypothesis  of 
no  treatment  effects  i.e., 


In  the  parametric  case,  it  is  usually  assumed  that  (i  •  1(  n,  j  ■  1 .  r) 

are  N(-  nr)  independent  and  identically  distributed  stochastic  vectors  distributed 


according  to  a  multinomial  distribution  with  a  null  mean  vector  and  a  dispersion 

(k) 

matrix  (positive  definite)  I  -  ((°k^)),  where  is  the  covariance  of  (e^  , 


for  k,  q  ■  1, 


....  p.  The  parametric  MANOVA  tests  are  either  based  on  the 


likelihood  ratio  criterion  or  on  the  characteristic  roots  of  some  determinants! 


equations.  The  likelihood  ratio  criterion  reduces  to  the  ratio  of  two  generalized 
variances  and  can  be  expressed  as  the  product  of  several  (p)  independent  beta 
variables  (cf.  Anderson  (1958,  Chapter  8)).  Alternatively,  one  may  work  with  the 


smallest  characteristic  root  of  the  determinental  equation  involving  the  same 


generalized  variances.  Occasionally,  some  symmetric  function  of  the  roots  are  also 
used.  For  details,  the  reader  is  referred  to  Rao  (1965,  chapter  8).  The  parametric 
tests  thus  appear  to  be  deterministic,  but  they  are  not  very  simple,  especially 
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for  p  >  2.  Further,  in  this  procedure  the  assumptions  of  independence  and.  multi- 
normality  of  the  error  vectors  play  an  indispensible  role.  Unlike  the  univariate 
case,  very  little  has  been  investigated  about  the  effects  of  departure  from  these 
two  basic  assumptions  on  the  performance  characteristics  of  the  parametric  MANOVA 
tests.  On  the  otherhand,  the  assumption  of  multinormality  of  the  error  vectors 
often  found  to  be  dubious,  especially  in  many  biometric  problems.  Further,  in 
many  problems,  there  appears  to  be  sufficient  evidence  on  the  stochastic  dependence 
of  the  error  vectors  within  the  same  block.  For  example,  in  agricultural  experiments, 
the  presence  of  spatial  correlation  may  distort  the  stochastic  independnece  of  the 
error  vectors  within  the  same  block.  Similar  dependence  may  be  due  to  genetic 
effects  in  many  animal  feeding  experiments.  The  object  of  the  present  investigation 
Is  to  relax  both  the  assumptions  of  multinormality  as  well  as  independence  of  the 
error  components.  In  fact,  for  the  tests  proposed  here,  we  require  only  that 
(i)  the  joint  distribution  function  FCs^,  ....  elf)  of  e^,  ....  elf  is 
continuous  and  Independent  of  i  -  1,  ...,  n,  and 

(ii)  F(e^,  eJr)  is  a  symmetric  function  of  its  r  arguments  (vectors) 

. «^r  i.C. ,  F  remains  invariant  under  any  permutation  of  the  r  vectors 

among  themselves,  or  in  other  words,  e^,  ...,  are  symmetric  dependent  stochastic 
vectors . 

Evidently,  both  the  assumptions  (i)  and  (ii)  are  much  less  restrictive  than 
the  usual  assumptions  of  independence  and  multinormality.  Thus,  the  proposed  method 
appears  to  have  a  comparatively  wider  scope  of  applicability. 

In  the  nonparametric  case,  practically  no  work  has  been  done  on  this  line.  For 
completely  randomized  layouts,  very  recently  some  nonparametric  MANOVA  tests  have  been 
offered  by  Chatterjee  and  Sen  (1964,  1966),  Sen  (1965,  1966a),  Puri  and  Sen  (1966), 
end  Anderson  (1965),  among  few  others.  Bhapkar  (1965)  has  also  presented  some 
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asymptotically  distribution-free  test  for  the  same  problem.  The  present  author 
(1966  b)  has  considered  some  rank  methods  for  combination  of  independent  experiments 
in  MANOVA.  The  same  procedure  is  applicable  in  our  situation  here,  but  it  fails  to 
be  suitable  in  some  respects.  This  problem  may  also  be  regarded  as  the  multivariate 
generalization  of  the  nonparametrlc  ANOVA  tests  relating  to  two  way  layouts.  Such 
ANOVA  tests  have  been  considered  by  Friedman  (1937),  Durbin  (1951),  Brown  and  Mood 
(1951),  Benard  and  Elteren  (1953),  and  others.  These  are  all  based  on  intra-block 
rankings,  and  the  same  method  can  be  generalized  to  the  MANOVA  problem.  The 
present  author  (1966  c)  has  considered  a  modified  approach  to  nonparametrlc  ANOVA 
tests  for  two  way  layouts.  Extending  an  idea  of  Hodges  and  Lehmann  (1962),  he  has 
considered  the  rankings  after  alignment,  and  under  a  suitable  permutation  model, 
has  obtained  a  class  of  genuinely  distribution-free  tests  based  on  these  modified 
rankings.  This  results,  in  most  of  the  cases,  in  an  increased  (at  least  asymptotically) 
efficiency  of  the  proposed  test.  The  object  of  this  paper  is  to  generalize  the  method 
of  rankings  after  alignment  to  the  MANOVA  problem  and  to  offer  some  suitable  non- 
parametric  tests  for  the  same.  For  this  purpose,  the  concept  of  multidimensional 
interchangeability  is  developed  and  certain  rank  permutational  ideas  are  formulated. 

With  the  aid  of  this  a  class  of  properly  distribution-free  rank  order  tests  for 
the  hypothesis  in  (1./)  is  developed.  Further,  the  celebrated  Chernof f-Savage 
(1958)  theorem  on  the  asymptotic  normality  and  power-efficiency  of  a  class  of 
univariate  nonparametrlc  test-statistics^  as  extended  to  the  multivariate  case  by 
Puri  and  Sen  (1966)  and  to  the  problem  of  compound  symmetry  of  multivariate 
distributions  by  Sen  (1966  c),  is  extended  further  to  take  care  of  the  problem  of 
multidimensional  interchangeability,  to  be  considered  here.  With  the  aid  of  this, 
the  asymptotic  power  and  power-efficiency  of  the  proposed  class  of  tests  are  studied. 
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2.  SOME  PRELIMINARY  NOTIONS. 


Let  us  define  a  set  of  r 1  real  quantities  by 


cij  “  *ij  ’  1/r  for  1  "  •"*  r> 


where  S. .  is  the  usual  Kronecicer  delta.  Thus.  E?  ,  c..  »  0  for  all  l 
*>3  J  “i  tj 

Let  us  then  consider  the  r  intra-block  contrasts 


1 ,  . . . ,  r. 


lit  "  Ej-1  C1J  ?ij»  1  “  1 . r* 


From  (1.6)  and  (2.2),  we  have 


r  £j-l  Ij *  +  (!ii  “  r  £j-l  ^ij5. 


(2.3) 


where  the  first  factor  on  the  right  hand  side  of  (2.3)  vanishes  when  Hq  in  (1.7) 
holds.  Further,  by  assumption  (ii)  of  section  1,  we  get  with  some  simple 
reasonings  that  the  joint  distribution  of  [(e^a  -  ~  eij^*  *■  ■  •••  r]  i» 

a  symmetric  function  of  the  r  (vector)  arguments.  Consequently,  from  (2.3),  we 
get  that  under  Ho  in  (1.7),  the  joint  distribution  of  (Y^  ,  ...,  Yif)  will  be  a 
symmetric  function  of  the  r  vectors  Y^,  ...,  Ylr«  On  the  otherhand,  if  Hq  in 
(1.7)  does  not  hold,  the  joint  distribution  of  (Y^,  ....  Y^^)  will  be  a  symmetric 
function  of  its  (vector)  arguments  only  when  each  one  of  them  is  adjusted  by 
appropriate  location  vectors.  Thus,  if  instead  of  the  observed  responses  X^'s, 
we  work  with  the  block-adjusted  yields  Y^'s,  our  problem  of  testing  Ho  in  (1.7) 
reduces  to  that  of  testing  the  hypothesis  of  Interchangeability  of  the  vectors 

Yil*  •“*  -ir  <for  *11  1  “  i . n),  »g»i««t  translation  type  of  alternatives. 

This  is  termed  the  problem  of  multidimensional  interchangeability,  and  a  formulation 
of  an  appropriate  rank  permutation  model  for  the  same,  will  be  considered  in  the 


125 


next  •action.  The  necessary  rank  order  statistics  will  be  defined  now. 

Let  us  pool  the  N(-  nr)  observations  {Y^j  *  j  -  1,  r,  i  •  l,  n} 

Into  a  combined  set  and  denote  the  ordered  observations  bv 


00 
(N)  * 


(2.4) 


where  by  virtue  of  the  assumed  continuity  of  the  distribution  of  the  error 
vectors,  the  possibility  of  ties  In  (2.4)  may  be  neglected,  in  probability.  Let 
than  C(u)  be  the  usual  sign-function  viz., 


c(u)  - 


>  1,  if  u  >  0 
0,  if  u  v_  0, 


and  let 


N 

1  +  I  c(Y 
o-l 


00  v00 

ij  "  Y(c0 


), 


(2.5) 


(2.6) 


for  i  -  1 . .  j  -  1,  . . . ,  r. 

Thus  stands  for  the  rank  of  Y^  within  the  set  (2.4).  This  ranking  procedure 
is  employed  separately  for  each  k  ■  1,  ....  p.  Consequently,  any  vector  Y^  having 
p  elements  is  made  to  correspond  to  a  rank  p-vector 

?ij  "  (Rif*  •••*  Rij})*  (2'7) 

for  i  •  1,  ...,  n,  j  -  1,  ...,  r.  The  composite  collection  is  a  p  x  N  matrix 


S*”  ■  (?11 . hr’  •••’  ?nl'  •••*  ?nr)  1 

Rjj  will  be  termed  a  collection  (rank)  matrix.  Each  row  of  is 
the  (lumbers  1,  ...,  N.  For  any  positive  integer  K(-  nr,  n  *  1, 


a 

2, 


(2.8) 

permutation  of 
. . .)  we  define 
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p  sequences  of  resl  numbers  by 


,00  .  ,„<k) 

:n  '“n,i  * 


’  “N,N"  •••»  P* 


's  are  all  real  quantities  and  ara  explicit  functiona  of  CgJj)  •  We  adopt 
the  coventional  Chernoff -Savage  (1958)  fora  and  write 

*£«  -  ®  *  1 . .  k  «  1,  ....  p,  (2.10) 

(k)  a 

where  the  function  '  need  be  defined  only  at  ,  a  ■  1,  . ..,  N.  However, 
we  shell  find  it  more  convenient  to  extend  its  domain  of  definition  to  (0,  1) 
according  to  the  Chernoff-Savage  convention.  Also,  we  define  rp  requences  of 
indicator  functions  {7.V”  ,  o  *  1 . N}  ,  for  j  »  1,  ...,  r,  k  •  1,  p  by 


1,  of  y|£5  Is  some  Y^}(i  -  1 . n) , 


0,  otherwise. 


for  o*l . N.  Then  we  define  rp  rnak  order  statistics 


It  may  be  noted  that 


(2.11) 


TS*J  "  »  Cl  *M  1-1 . t,  It  -  1 . . 


i  Ci  T"  -  s  Ci  4“  ■  <“>’■>•  k  '  1 . .  ”-13> 


. E^P')  are  all  known  constants  (depending  on  N).  Thus,  at  most 

(r  -  l)p  of  the  rp  variables  in  (2.12)  are  linearly  Independent.  Our  proposed 
test  is  baaed  on  the  set  of  random  variables  in  (2.12).  To  develop  strictly 
distribution-free  testa  for  the  hypothesis  (1.7),  ve  shall  consider  in  the  next 


where  E*1* 
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•tction  some  permutation  model.  But,  before  that  It  may  be  worth  writing  . 
a  point  of  clarification.  The  class  of  statistics  in  (2.12)  has  some  similarity, 
with  that  of  a  similar  class  of  statistics  considered  by  Purl  and  Sen  (1966). 
However,  in  the  later  case,  we  have  a  one  way  classification  with  N  independent 
p- variate  observations,  while  In  this  case,  we  have  a  two  way  clasaification 
with  n  independent  pr-variate  observations.  This  makes  the  situation  somewhat 
more  complicated,  and  requires  a  more  specialized  attention  for  both  the  permutation 
as  well  as  asymptotic  test  theory. 


3.  RANK  PERMUTATIONS  FOR  MULTIDIMENSIONAL  INTERCHANGEABILITY . 


The  collection  matrix  Rjj|X**,  given  by  (2.8),  is  now  expressed  in  terms  of  n 
submatrices  R^xr,  ...,  R^xr,  where  R**xr  is  the  matrix  of  the  r  rank  p- tup lets 
corresponding  to  (Y±i,  ....  Yir>,  for  i  ■  1,  ...,  n.  Thus,  we  have 


jwN  ,.pxr  -pxr. 

?N  '5.  . }‘ 

How  under  the  null  hypothesis  (1.7),  the  Joint  distribution  function  GfY^, . . . **ir) 

is  a  symmetric  function  of  Y^,  ...,  Yir,  and  hence,  the  same  remains  invariant 

under  any  permutation  of  the  r  vectors  in  the  r  positions  of  G.  Since,  there 

are  r!  possible  permutations  of  the  r  vectors  among  themselves,  the  permutational 

probability  (l.e. ,  conditional  probability)  mass  associated  with  each  of  the  r! 

possible  permutations  is  equal  to  (r!)”  , (under  Hq  in  (1.7),)  for  all  i  ■  1,  ...,  n. 

Since,  (Yjj»  ....  Y^r)  is  distributed  (jointly)  independently  of  (Y^,  ...,  Y^,r) 

for  ell  1  }  1*  ■  1,  ...,  n,  the  joint  distribution  of 


(3.1) 


TH  "  (Y11’  Yir*  Ynl . Ynr} 


(3.2) 


remains  Invariant  under  the  following  finite  group  of  transformations  '  {  gR  > 


which  nape  the  sample  s  ;ace  of  YM(gnto  itself.  The  number  of  elements  of  ^  is 
equal  to  (r!)n,  and  typically  a  transformation  is  such  that 

*n  In  -  -  <!tl*  ••••  lit . InV  ••  Inr>»  <3*3> 

where  (Y*^,  •••»  Y*^)  la  any  permutation  of  (Y^,  *ir) ,  1*1,  ....  n. 

Let  ^  be  the  Kp-dimensional  sample  space  of  (and  ve  take  it  to  bo  the  Np-diaenslonal 
Euclidean  space).  Evidently,  the  sample  space  of  Y*  in  the  same  as  that  of  Y^,  and 
moreover,  under  Hg  in  (1.7),  the  joint  distribution  of  remains  invariant  under 


the  group  of  transformations 


Let  now  S(Y^)  be  a  (real  or  vector  valued) 


function  on  Y^.  Then,  for  any  Y^  e  we  will  have  a  set  of  (r!)n  values  of  S(YN>, 
obtained  under  the  group  of  transformations  and  this  set  is  denoted  by  l(Y^) . 

Then,  under  the  null  hypothesis  (1.7),  the  conditional  distribution  of  S(Yjj)  over  the 
set  E(Y„)  will  be  uniform.  Let  us  define  ss  in  (2.2),  and  let 

T*xp  -  .  i,  ....  r,  k  -  1 . .  (3.4) 

Then,  it  follows  that  Tu  is  a  stochastic  matrix,  which  under  the  group  of  transformations 
can  have  only  (r!)n  possible  realizations.  Since  is  an  explicit  function  of  the 

N  rank  p-tuplets  R^ ,  1  *  1 . .  j  ■  1,  ...,  r,  it  will  be  more  convenient  for 

us  to  review  the  above  Invariance  argument  in  terms  of  the  following  rank-lnvariance 
argument. 

The  way  in  which  we  have  defined  In  (2.8)  and  (3.1),  it  follows  that  for  any 
Yg  c  there  will  be  a  corresponding  collection  matrix  R^.  On  examining  the  group 
of  transformations  it  will  be  clear  that  the  transformation  gn  on  Yg,  given 
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by  (3.3),  gives  rise  to  another  collection  matrix  P*,  which  is  obtained  by  applying 
the  same  transformation  gQ  on  the  original  collection  matrix  y  Thus,  under  the 
erouo  of  transformations  of  {*  }  ,  the  rank  collection  matrix  R,,  (corresponding 

*  *  %#U  U  ^ 

to  c  y  gives  rise  to  a  set  of  (r!)fl  rank  collection  matrices  (obtained  by 
applying  the  same  transformations  { 8n > » )  and  this  set  is  denoted  by  f(y .  If  R* 
is  any  member  of  I(y ,  we  note  that  Rg  is  really  derived  from  R^  by  a  finite  number 
of  inversions  of  the  columns  of  the  later.  Thus  we  may  write 

Rg  *  Ry  (mod  £^)  for  all  Rg  e  l(y .  (3.5) 

Hence,  the  set  E(y  contains  (r!)n  rank-matrices  which  are  permutationally  (under 
inversions  of  intra-block  columns)  equivalent  (under  to  y  Thus,  we  term 

£(Rg)  as  the  permutation  set  (mod  2^.)  of  y  like  is  a  stochastic  variable, 
and  each  row  of  R^  is  a  permutation  of  1,  ...,  N.  Thus,  R^  can  have  (N!)**  possible 
realisations ,  and  this  set  of  all  possible  realizations  of  R^  is  denoted  by 
so  that 

Rj,  e  icy  C0^N.  (3.6) 

The  probability  distribution  of  R^  on  0^  (defined  on  an  additive  class  of  subsets 
of  O^i,,)  will  depend  on  the  unknown  joint  distributions  G  (Y^,  ...,  Y^), 

1-1,  ...,  n,  even  under  Hq  in  (1.7).  Thus,  unlike  the  case  of  univariate  one  way 
classified  data,  the  use  of  the  unconditional  distribution  of  R^  will  fail  to 
provide  a  distribution-free  test.  However,  from  what  has  been  discussed  before,  it 
follows  that 


MRj  •  Rg  |E<y.  Ho)  -  (rJ)  ",  (3.7) 

i 

for  ail  Rg  t  E(y  ,  Independently  of  G(Y^,  Yir) ,  1  -  1,  ....  n.  Now,  the  way 


I 


/u\ 

in  which  ,  k  ■  1,  ....  p,  are  defined  by  (2.9),  (2.10),  it  follows  that  in  ' 
[(2.12),  (3.4)]  is  an  explicit  function  of  Rg.  Thus,  the  set  C(Rg)  will  give  rise 
to  a  set  of  (n!)n  realizations  of  Tg,  and  this  set  is  denoted  by  I(Tg) .  Hence, 
under  the  perautational  probability  measure  (3.7),  we  will  have  a  completely  specified 
perautational  distribution  of  Tg,  and  the  corresponding  perautational  probability 
measure  is  denoted  by  6^.  Let  us  then  consider  a  test  function  $(Yg)(0  <  *  _<  1) , 
which  to  each  Yg  e  Yg  associates  a  probability  of  rejecting  HQ  in  (1.7),  with  the 
aid  of  <Pa.  It  follows  that  we  can  always  select  $(Yg)  in  such  a  manner  that 


[  4  <Y*) 

T8‘£<V 


(r!)n-  t, 


i 

(3.8) 


where  c(0  <  e  «  1)  is  the  preassigned  level  of  significance  of  the  test.  Consequently, 

♦  (Yg)  has  the  S(e)  -  structure  of  tests  £cf.  Lehmann  and  Stein  (1949)]  ,  and  is  a 
similar  alze  c  teat  for  the  null  hypothesis  (1.7). 

Now,  in  actual  practice,  we  prefer  to  use  some  single-valued  function  of  Tg  as 
a  test-statistic.  There  seems  to  be  no  definite  suggestions  regarding  the  structure 
of  this  test-statistics,  and  an  optimum  choice  naturally  may  depend  appreciably  on 

j 

the  particular  class  of  alternatives  we  have  in  mind.  However,  it  may  be  suitable  { 

(though  not  necessarily  optimum)  to  consider  the  following  test-statistic  which  is  j 

the  quadratic-form  associated  with  the  asymptotic  permutation  distribution  of  Tg.  ■ 

I 

Tor  this,  let  us  consider  first  the  perautational  moments  of  Tg.  It  readily  follows  j 

that  | 

* 

I  |5»n  )  “  *Hk).  for  k  -  1 . .  j  -  1 . r.  (3.9) 

i 

Let  us  define 
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C  oo  ■  7  *J.l  <k>  •  1  ■  * . ”•  k  -  1 . .  <3-10> 

i-  •  ij 


as  the  intra-block  averages.  Also  let 


r  (p  )  .  — j -  y  y  E0t)  _g(k)  =(q)  _g(q) 

kq  ?N  n(r-l)  ±ml  ^  ■  N>R^<k)  NfRi  00  <q)  N>R^  (q) 


,  <3.11 


for  k,  q  1,  p; 


yy  -  «vv”*. ,  - 1 . 


(3.12) 


It  is  then  easy  to  verify  that 


-fe<Vr'uVy- 


(3.13) 


for  k,  q  •  1,  ...,  p,  j,  j'  -  1,  ...»  r,  where  4^ ,  is  the  usual  Kronecker  delta. 

For  the  time  being,  let  us  assume  that  Vr(Rjj),  given  by  (3.12),  is  positive  definite, 
and  denote  its  reciprocal  matrix  by 


S‘<y  •  «’'k,<y»u, , .  i . . 

Our  proposed  test-statistic  can  then  be  expressed  as 

h  ■ "  Jj  Jj  *k,<y  j,  -  ^k,J  -  «?>] 


(3.1A) 


(3.15) 


and  it  may  be  noted  that  is  essentially  a  non-negative  stochastic  variable.  We 
shall  see  later  on  that  under  certain  regularity  conditions  on  GCY^,  ...»  Yir)» 

Vjj(Rjj)  is  positive  definite  with  a  very  high  probability,  (precisely,  in  probability). 
However,  if  V^Ol^)  fails  to  be  non-singular,  we  may  work  with  the  highest  order 
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principal  minor  of  Vjj(Rjj)  which  la  positive  definite,  end  proceed  similarly  only 
with  the  responses  pertaining  to  this  minor.  Thus,  for  convenience,  we  may  assume 
Vg(lg)  to  be  positive  definite.  Now, 

1<Sl!((Pn)  •  p(r  -  1),  (3.16) 

and  Sjj  measures  the  distance  of  ,  in  (3.4),  from  the  permutatlonal  centre  of 
gravity  of  the  same.  If  Bo  in  (1.7)  does  not  hold,  it  can  be  shown  that  for  at 

least  one  k  »  1 . .  and  one  j  ■  1,  ...,  r,  will  converge  to  e  point 

(stochastically)  other  than  and  hence,  by  (3.15),  SN  will  be  stochastically 

larger.  Thus,  we  may  propose  the  following  teat  function: 


♦«.>  • 


1-uh>  «,,.<!,>• 

«  v  ■  v«<y. 

"•u»,  * 


(3.17) 


where  the  constants  t(F^)  and  Y(K{|)  may  usually  depend  on  and  are  so  chosen 
that 

*<♦(][„>  l£Pn>  -  t:  0  <  c  <1.  (3.18) 


(3.18)  implies  that  E{4(YI})|Ho)  ■  c.  ?or  small  values  of  n(and  r),  one  may  venture 
to  evaluate  the  exact  values  of  Sg  t(R^)  and  y(R^)  with  the  aid  of  (3.7).  However, 
the  labor  of  this  process  of  evaluation  Increases  considerably  with  the  Increase 
in  n(or  r),  and  hence,  as  in  other  permutation  tests,  we  are  faeed  with  the  problem 
of  finding  out  the  asymptotic  form  of  the  permutation  distribution  of  Sg.  This  is 
dona  in  the  next  section. 
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4.  ASVMPTOTIC  PERMUTATION  DISTRIBUTION  OF  S„. 

M 

■  We  shall  Impose  certain  regularity  conditions  on  the  p  sequences  {E^  }, 

k  »  i,  . p,  defined  by  (2.9)  and  (2.1U),  as  well  as  on  the  joint  distribution 
function  C(Til#  . ..,  Yir).  Let  us  define 

**[j]  ^  "  n  [Nunber  of  i.  *]  »  k  ■  1,  ....  p,  j  -  1,  r;  (4.1) 
B^k)  (*)  k  -  1,  ••••  P5  (4.2) 

FN[j?t]<X*  y)  "  n  CNumber  of  <^k),  Y^})  <  (x,  yjTJ,  (4.3) 

for  k,  q  -  1,  . ..,  p,  j,  1-1,  ...,  r  with  either  j  +  l  or  k  f  q  or  both. 

(k) 

Now,  corresponding  to  the  joint  cdf  G,  let  us  denote  the  marginal  cdf  of  Y^  and 

of  (Y^ ,  Y^jp)  by  and  y)»  respectively,  for  j ,  1-1,  ...»  t,  k, 

q  ■  1 . .  with  at  least  one  of  j  j*  1,  k  i  q  being  true,  and  let 

H°°(x)  -  7  iJ.j  p[Jj(x),  for  k  -  1 . .  (4.4) 

(k) 

With  the  definition  of  ^  'fa  as  in  (2.10) f  we  make  the  following  assumptions 

(Is) 

concerning 

ASSUMPTION  1.  J^k)(H)  -  J(k)(K)  exists  for  all  0  <  H  <  1  and  is  not  a  constant. 

Since,  we  shall  be  Interested  here  in  translation  type  of  alternatives,  we  shall 
further  assume  that 

J*k)(H)  Is  t  in  H:  0  <  H  <  1  for  all  k  -  1 . .  (4.5) 
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■  (zj^, . . .,  zj' p^),  j  ■  1,  ...,r; 


(4. 12) 


\q.tf  -  *{*ti>-z{j))  for  <1  ■  )>  *  "  h-,ri  (4.13) 


Aji  "  k,q-  l,...,p»?'J»  1  "  1",'rJ 


(4. 14) 


vkq"  r*>l  ViJ  ‘  ?  VK'  for  k#  q"1,,,,'p  <4*15> 


-  "  ^kq^k,  q-l,...,p* 


ASSUMPTION  5.  V  la  positive  definite 


(4. 16) 


(4.17) 


Before  we  present  the  main  theorems  of  this  section,  let  us  consider  the 
conditions  under  which  assumption  5  holds.  Using  (4.14),  let  us  define 


^(J»i>  "  ^JJ  +  -12  ‘  2  9 


THEOREM  4. 1  Assumption  5  holds  if 


(4. 18) 


[Rank  o£  A(J,f)3  "  P 

PROOF,  Let  •  *if  p)  he  any  reel  end  nor.-null  p-vector,  and  let 

yi'-V  '•  ■  r  =>1  v 

where  Z  *s  are  defined  by  (4.12).  It  is  then  easily  seen  that 
**lj 

i*i  -  «*5>  -«<«?>  a  «• 


(4.19) 


(4.20) 


(4.21) 


Thus,  we  require  only  to  show  that  for  any  non-null  t,  (4.21)  is  strictly 
positive.  Using  essentially  the  proof  of  lemma  4. 1  of  Sen  (1966),  it  can 


be  shown  that  ^E,*.  E(t?)  -  E(t2)  will  be  strictly  positive  unless 

X  }m  i  J  e 

l(t^t|)  •*  E(tp  ■  constant,  for  all  J,2“l, . .., r. 


(4.22) 
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Mow,  using  (4.18)  and  (4.19),  we  get  that 

f(j,i)l  >  °>  (4,23) 

for  at  least  one  pair  (j,i),  Jj«i»l,...,r.  As  RU^-tj)2  £  2[S(t®)+E(t|)], 

(4.23)  Implies  that  E(t2)  >  0  for  at  least  one  J-l, ...,r.  Again,  for  the 
specific  (J,A)  for  which  (4.23)  holds,  we  may  assume  without  any  loss  of  generality 
that  E(t2)  £  E(t2),  E(t2)  >  0,  and  thus,  we  require  only  to  show  that  *(tjt|)  <  E(t^). 
If  B(t2)  -  0,  the  proof  Is  evident,  while,  if  B(tJ)  >  0,  we  have  from  (4.23) 

MCtjt,)  <  E(t“)  +  E(t*)  <  2E(tj).  Hence,  (4.22)  car.  not  hold  for  all  J,l-l,...r, 
if  (4.19)  holds.  Consequently,  (4.21)  is  strictly  positive. 

Hence,  the  theorem. 

It  may  be  noted  that  (4.19)  really  implies  that  the  vector  (gij  -  Z^)  Is  of 
full  rank  for  at  least  one  Jj*A“l, . . .,  r. 

THEOREM  4.2.  Pndar  the  assumptions  1  to  3,  ^(jfo),  defined  by  (3.12).  converges 

in  probability  to  V,  defined  by  (4.16),  end  hence,  is  positive  definite,  in  prob- 

ablliey. 

. .  ■  ■  » 

PROOF.  The  proof  of  this  theorem  follows  as  a  more  or  lass  straightforward  gen* 
eralisation  of  theorem  4.2  of  Puri  and  Sen  (1966)  and  of  theorem  4.2  of  Sen  (1966c). 
Hence,  for  the  Intended  brevity  of  the  paper,  it  it  not  considered  in  detail. 

THEOREM  4.3.  Under  the  assumptions  1  to  5,  the  permutation  distribution  of  the 
atatlstic  3M,  defined  by  (3.15),  converges  asymptotically,  in  probability,  to  a 
chi  square  distribution  with  p(r-l)  degrees  of  freedom  (d.f.). 

PROOF.  He  shall  first  prove  that  under  the  permutation  model  considered  in 
Section  3,  [n£(l^  *  E^),  J“l, . . .,  r-1,  k*l,  ...,p)  has  asymptotically  a  p(r-l) 
multinormal  distribution.  This  would  be  done  by  proving  that  any  arbitrary  linear 
function  of  these  p(r-l)  statistics  has  asymptotically  a  normal  distribution  undar 
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the  permutation  model  of  section  3.  Such  a  linear  compound  can  be  equivalently 
written  as  (by  virtue  of  (2.13),) 


W  -  n^  L  .r 


rn"  n?I>lSk£l  djk  dJk"  °s  *■!.  •••,?.  (4.24) 


Under  aaaumpticm  2,  (4*24)  can  be  rewritten  aa 
n  »  n  «(b) 

B*  £  (£  £  djfc  J<k>  <Sff^  +  0o<1>‘ 

i-1  j-1  fc»l  JK  p 


(4.25) 


Let  ue  then  write 


V(V  ■  ji  Jj  djk  j00<5H->  *  (4.26) 

The  random  variable  Ujj  jCJjj)  can  ^ave  only  r>  possible  equally  likely  values 
trader  our  permutation  model.  These  values  are  obtained  by  permuting  the  r 
vectors  >1, .  .  «,r  (defined  by  (2.7),)  among  themselves.  Thus, 

P  r  R<k>  r 

,W»,l<«lW  '  i,  !r  =  ,<k><5M-»  ^  dJk-  0,  (4.27) 


00 


for  i"l, ...,n.  Similarly, 


J-1 


,(k) 


.<0 


p  p  i»  r  R'"'  r>  _ 

-  £  *  *  £  “Jk  4|,  '»<k)<5y-)J<,)(5M-)|  «-„) 


j,  Vj,hA  Ji<  £  •><k)<^-)) 


P  P  r 
£ 

k-i  «ri 


(^  J<q)(gff-))])  for  i«l, . .  ,,n. 


(q> 


(4.28)  , 
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Sine*  the  permutation*  of  the  rank-vector*  within  the  ith  block  1*  indepen¬ 
dent  of  the  permutations  within  the  i*th  block  for  ij*i»*l, . . .,n,  under  our 
permutation  model,  {U^  ,i«V*  i“l, ...,n)  are  mutually  independent.  Bence,  to 
prove  the  desired  result,  we  may  use  the  Berry-Sssen  theorem  [cf.  Loave  (1962, 
p.  288)],  according  to  which  it  is  sufficient  to  show  that 


u.  ri!i  «IVi<h>l*H».>  .  , 

■”  t£l-l,(UH,t<Vl5’.,l3/2 

From  (3.11),  (3.12)  and  (4.28),  we  get  that 


(4.29) 


n 

i-1 


r  p  p 
£  £  £ 
jri  k-i  <ri 


dJkdJq^q^ 


r  P 

£  (  £ 

J*1  kPl 


P 

£ 


dJkdJqvkq 


P 

{£ 


P 

£ 


J*1  k“l  ^1 


d ..  d.  V.  ), 
Jk  Jq  kq  * 


(4.30) 


whereby  theorem  4.1  and  assumption  5,  the  right  hand  side  of  (4.30)  is  a  (non- 
sero)  positive  constant,  for  any  given  (d^,  J"l,  ...,r,  te*l,  ...,p).  Thus,  It 
la  sufficient  to  show  that  the  numerator  of  the  left  hand  aide  of  (4.29)  is 

P 

and  this  readily  follows  from  assumption  3  and  (4.26).  Hence,  under  our  per¬ 
mutation  model,  the  first  term  of  (4.23)  has  asymptotically,  in  probability,  a 
normal  distribution.  Once  this  it  established,  we  consider  the  quadratic  form 
associated  with  the  asymptotic  multinormal  distribution  of  (n^(I^  -  £^), 
J»l, . . .,  r-l,  k“  1,  ...,p),  and  using  some  well-known  results  on  the  limiting 
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distribution  of  continuous  functions  of  random  variables  [cf.  Sverdrup  (1952)], 

It  Is  easily  seen  that  under  our  permutation  model,  the  statistic  V  given  by 
(3.15),  has  asymptotically,  in  probability,  a  chi  square  distribution  with 
p(r-l)  d.f. 

Hence,  tha  theorem. 

Xt  may  be  noted  that  the  permutation  distribution  of  being  essentially 
a  conditional  distribution,  the.  convergence  in  theorem  4.3  holds,  in  probability, 
i.e.,  for  almost  all  If  we  now  denote  by  x|  £  the  upper  I00e%  point  of  the 
chi  square  distribution  with  t  d.f.,  then  from  (3.17)  and  theorem  4.3,  we  arrive 
at  the  following. 

THEOREM  4.4.  Sjj  c(j6||)  and  y(gjj),  defined  by  (3.17),  converge,  in  probability  to 
Xp(r-1),  £  and  0,  respectively. 

By  virtue  of  theorem  4.4,  the  exact  permutation  test,  considered  in  (3.17), 
reduces  asymptotically  to 


*<v 


lp  if  SN  £  X%(r-l),e 


0,  otherwise; 


(4.31) 


and  (4.31)  will  be  termed  henceforth  the  asymptotic  permutation  teat. 

\ 

5.  ASYMPTOTIC  POWER  OF  THE  PROPOSED  TESTS. 

In  this  section  we  shall  study  the  asymptotic  power  and  power-efficiency  of 
eur  proposed  class  of  testa.  This  requires  first  of  all  the  study  of  the  asymptotic 
(unconditional)  distribution  of  8^,  when  the  null  hypothesis  (1.7)  is  not  nec¬ 
essarily  true.  For  this  study,  we  also  adopt  the  same  notations  as  in  section  4, 
and  write 

•  f  -4k)  «•» 
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for  J»l, ...,r,  to“l,  ...,p.  The  statistics  In  (5.1)  has  some  analogy  with  a  • 
clans  similar  statistics  considered  by  run  and  den  (I9t>bj.  However,  In 
thla  caaa  of  two  way  layout  we  are  faced  with  n  independent  pr-variate  observa¬ 
tions,  while  in  the  earlier  caae.  Purl  and  San  ware  faced  with  the  oneway  layout 
involving  H(«nr)  p-variate  observations.  This  makes  the  situation  somewhat 
■ore  complicated  in  our  case,  and  the  necessary  modifications  will  be  studied 

y  ■ 

hero.  Lot  ua  define 

u00  -  .  /" J(k)(H(k)(x»  dP^jc*),  (5.2) 

for  J-l,  ...,r,  te*l,  ...,p.  Also  let 

pjj'jjg,  -  }  /"[F^J^Cmy)  -  P[<kj(x)P[(]J]<y)]  J'°°<H(k><x)>  J'(q)(H(<I>(y)). 

dp^(x)  dF^jJ^y),  (5.3) 

for  l»  -  1,  ...,r,  k,  <rl,...,P»  with  either  )J»J*  or  tyq  or  both,  while 

P[j]<x)Cl‘F[j](y)]  J,<k><H<k)(*»  J»<k>(H<k)(y))  dpfJjtxJdF^!] 

+  ;  t  F[^](x)Cl-F[5]<y)]  J'<k>(H<k)(x))  J'(k)(H(k)(y))  dF[<k)](x)dF[(}}(y), 


for  J*l, . . .,  r,  kfl,...,p,  1,1'  ■  !,...»  r. 


(5.4) 


Finally,  let 


«(k, q)  .  if  l  1  fB(k,q)  +  B(k,q)  .  .<k,q)  .  .<k,q)  ]  1 


(5.5) 


for  k,?*l,  ...,p;  J, )'  ■  1, ...,r. 

■nntftjMM  s.t.  If  the  assumptions  1,2  and  3  of  section  4  hold,  then  for  arbitrarily 


(y) 
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continuous  Qtf,,,...,!,,),  the  random  variable*  [^(T^  -  p^),  J*l,...,r, 

1p*1,  . .  .,p]  has  asymptotically  a  multinomial  distribution  with  a  null  wean 

fit  ol 

vector  and  a  dispersion  matrix  with  elements  defined  oy  (5.5). 

(It  my  be  noted  that  by  virtue  of  (2. 13),  (4.4)  end  (5.2),  the  above  multl- 
nonael  distribution  will  be  essentially  singular  with  a  rank  less  then  or  equal 
to  p(r-l).) 


PROOF.  We  shell  present  only  a  brief  sketch  of  the  proof,  as  the  sene  will 
follow  precisely  on  similar  lines  as  in  theorem  5.1  of  Puri  and  Sen  (1966)  and 
theorem  5.1  of  Sen  (1966c).  Proceeding  precisely  on  the  sane  line  as  in  the 
proofs  of  these  two  theorems  It  can  be  easily  shown  that 


V0’ 


(5.6) 


for  all  J*l, ...,r,  knl, ...,p,  where 


.(k)  +  _(k)  .1  l  f  1  *  f  (k)  „(k>.  _  (k)  ,„(k>  1 
BJ,1M  *  Bj,2H  r  *ml[  n  ^  l.BJ:j'(YlJ  >  BJ*:j<Yij.>  J 

Bj1|l<YiJ>)  "  /  [Fui(i)<X)  "  Ftj3(x)]  J,(k)(H(k)(x))  dP[(Jj(x)  ;  (5.8) 


P 


<k) 

[J](D 


(x)  - 


0,  if  x  <  Y*k) 
1,  if  x  ^  Y<k>, 


(5.9) 


for  i*l, .  ..,n,  j,  1*1, ...,r,  k“l, ...,p.  It  is  therefore  sufficient  to  show  that 

for  any  arbitrary  non-null  6  *  (6  6  ),  Z  T.  6..(B*k*  +  B4kL>  h*» 

.  11  Pr  j«l  J* 

asymptotically  a  normal  distribution.  By  virtue  of  (5.7),  the  same  can  be 
written  as  n^t^  •'Iir>'  where 
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.(kX^kX, 
Bj!jtYu  'J> 


i-i#. 


>n. 


(! 


Since,  the  random  variables  In  (5. 10)  are  independent  and  identically  distributed, 
in  order  to  make  use  of  the  central  limit  theorem  under  the  Llndeberg's  con¬ 
dition,  it  is  sufficient  to  show  that  these  have  finite  second  order  moments. 
Using  (3.8),  it  Is  easily  seen  that  E(S(|i:l, . . .,|ir) }  “  0  for  all  1*1, ...,n, 
and  by  virtue  of  (5. 10),  it  appears  to  be  sufficient  to  show  that 
»(|B^  (Y^|a)  <«  for  all  j,J»l, . . .,  r,  W,...,p,  i*»l,  ...,n.  Sow,  under  the 
assumption  3  of  section  4,  it  is  easily  seen  that  for  any  tj:  0  <  tj  <  S  (defined 
by  (4.8),) 


*(!»$  (i5>>|*')  <  -> 


(5.11) 


uniformly  in  J,J«1, . . .,r,  k-l,...,p.  Hence,  the  desired  asymptotic  normality 
follows  readily.  Asaln,  by  (5.7),  (5.8)  and  (5.9),  we  have 


where  8^, 
and  (5.4), 


■  r|(k).—(k)»  •(<()  ))  ■  K 

BtBj:l(Ylj  *  BJ«sJ'tYl*J*,)  5il» 


Is  the  usual  Kronecker  delta  and  (3 j j* ^ t 
*or  J,J»,J,i  •«!,..  .,r,  k,^>l,...,p.  Hence 


PJJ»sJi'  *  (3.12) 

•s  are  defined  by  (5.3) 

,  it  is  easily  seen  that 


*  +  *.%>> 


p(k,q) 

pji 


9 


(5. 13) 


which  is  defined  by  (5.5),  for  k,^rl,...,p,  J,i"l,...,r.  Consequently,  by 
(5.8),  we  may  conclude  that  the  dispersion  matrix  of  tha  asymptotic  normal 
distribution  has  elements  defined  by  (5.5). 

Hence,  the  theorem. 

He  have  already  noted  that  the  asymptotic  normal  distribution  of  theorem 
5.1  is  singular  and  of  rank  at  most  equal  to  p(r-l).  If  the  null  hypothesis 


.10) 
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in  (1.7)  is  true*  G(Y^j# . . .,Yf f)  will  be  a  symmetric  function  of  the  r  vectors, 

(k) 

and  hence  it  is  easily  seen  that  (i)  the  marginal  cdf  of  Y^'  will  be  the  same 
for  all  j"l,  .  ..,r,  l«l,  .  ,.,n,  and  is  denoted  by  H'*;(x)  for  k"l,  .  ..,p;  (11)  the 
marginal  cdf  of  Y^)  (l^*q)  will  not  depend  on  j,  and  la  denoted  by 

j(k»9)(j^y)  f0K  (iil)  the  marginal  cdf  of  (Y^,  y£^)  (j/f) 

will  not  depend  on  (J ft),  and  is  denoted  by  Hjk,^(x,y)  for  jj<i«l, .  ..,r, 
k,  qpl, .  ,.,p.  Thus,  it  follows  from  (5.3),  (5.4),  (4.11)  through  (4.14)  that 
in  this  case 


"  *kq-JJ*  "  “kq^  9  if  ■  >x> 

-  «  #J ‘-I,.-. ,r. 


(5.14) 


where  a^  depends  only  on  H^k,,^(x,y)  and  a£^  on  y),  respectively. 
Thus,  from  (4.15)  and  (5.14),  we  get  that  In  this  case  defined  by  (4.15), 
reduces  to 


Vk  -  [(r-l)/r](aJJ)  -  a***),  k,<rl,...,P,  (5.15) 


i(k,q) 


jl#  “  (®j|r_i)  Vkqi  k, q*l,  ...,p,  (5. 


where  Is  the  usual  Kronecker  delta.  Consequently,  it  is  easily  seen  that 
under  H0  in  (1.7), 


l  S  vkq  E  <T<k>  -H<k))(^q!  -Ji<q))  (5.17) 


(where  ((vkq))  is  the  reciprocal  of  ((v.  )),  and 

kq 
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|»<k)  -  /Vk)(u)du,  Vpl,...,p,) 

0 

hat  asymptotically  a  chi  square  distribution  with  p(r-l)  d.f.  Mow,  under 
assumption  2  of  section  4 

|**(i£k>  -ji<k^|  -  ©(1),  for  k»l,...,p,  (3.18) 

and  by  theorem  4.2,  we  have  under  assumption  3  that 

W  ^  u  *-•  >  il<fc> o. i9) 

Hence,  from  (3.13),  (3.17),  (5.18)  and  (3.19),  we  get  that  under  Hq  in  (1.7) 

SN  £  Sg.  (3.20) 

Hance,  we  arrive  at  the  following. 


THKOMH  3.2.  Under  in  (1.7)  and  assumptions  1  to  3  of  section  4.  the 
statistic  8^  in  (3.13)  hae  asymptotically  a  chi  square  distribution  with 
p(r-l)  d.f. 

Let  now  £  be  any  consistent  estimator  of  defined  by  (4.13)  and  (3.13). 
If  £  is  positive  definite  and  we  denote  its  reciprocal  by  »  ((yk<1)),  then 
we  can  have  an  asymptotically  distribution-free  test  based  on 

8  ■  n  £  l  $kq  E  (T<k)  -  *  ^q>>-  (5 

Since,  Sjj  can  be  shown  to  have  the  chi  square  distribution  with  p(r-l)  d.f., 
when  W0  in  (1.7)  holds,  the  test  function  may  be  proposed  as 


(3.21) 


i,  if  sN  >  xap<ri>1^s 


0,  otherwise. 


(3.22) 
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We  ehall  now  consider  the  power  properties  of  the  permutation  test  in  (3.17) 
and  (4.31)  and  the  large  sample  teat  in  (5.22).  We  shall  obtain  certain  power* 
equivalence  relations  among  theie  teats,  and  compare  them  with  the  parametric 
tests  <;.'f erred  to  in  Section  one. 

By  virtue  of  theorem  5.1,  it  can  be  shown  that  if  the  linear  model  (1.5) 
holds  but  the  null  hypothesis  (1.7)  is  not  true,  then  (p^  -  5^),  j»l, ...,r, 

Jp*i, can  not  all  converge  to  zero  as  N->»,  and  hence,  S^,  defined  by  (3.15), 
will  be  stochastically  indefinitely  large,  as  N  increases.  Consequently,  the  tests 
considered  will  be  all  consistent.  2hus,  for  any  given  (r^, ...,;$)  in  (1.6), 

(not  all  null),  the  power  of  the  test  (3.17)  or  (4.31)  or  (5.22)  will  be  asymp- 
totically  equal  to  unity.  Hence,  forthe  study  of  the  asymptotic  power  properties 
of  the  tests,  we  shall  consider  a  sequence  of  alternative  hypotheses  for  which 


the  power  asymptotically  lies  in  the  open  interval  (e, 1).  This  we  specify  as 


hh!  Zj  "  kj' 


(5.23) 


where  Xj,  J*l, ...,r  are  all  real  p-vectors,  not  all  equal  (or  null).  Further, 
for  simplification  of  the  asymptotic  power  function,  we  shall  assume  that  the  cdf 
F$(x),  and  *r*  absolutely  continuous  and  have  con¬ 

tinuous  density  functions.  Under  (H^)  in  (5.23),  we  will  thus  have  sequences  of 
cdf'a  jj(x))  ate,  defined  for  each  N,  and  it  is  easy  to  verify  that 


lim  -(b) 


*$,„(*)  -  H<k)(x)  for  all  J-i,...,r, 


(5.24) 


JIj  F[  j](x»y)  ■  H*k,q)(x,y)  for  all  )-l,...,r,  k/q-l, , . p 

(5.25) 

f[ j*l](*»y)  “  H^k,q\x,y)  for  j/i-l,...,r,  k,<rl,...,p.  (5.26) 
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Hence,  In  this  cast  also  (3.16)  hold*  in  the  limit  as  N->®.  Also,  If  we  define 

cfc  -  /  4:  J<k)<H(x>(x»  dF<k)(x),  1^1,. ...p,  (3.27) 

r  SB 

then,  it  It  atty  to  show  that 


(3.28) 


for  all  $*1,.  ..,r,  fe*l,  ...,p.  Hence,  from  the  results  of  theorem  5.1  it  fol¬ 
lows  that  under  (H^),  8*  has  asymptotically  a  noncentral  chi  square  distribu¬ 
tion  with  p(r-l)  d.f.  and  the  noncentrality  parameter 

l  l  vkq  L?  (7  E  (5 

*  le-1  <pl  *  p  r  J-l  3  J 

where 

^jqVr»  for  9"1# •••»?• 

Mow,  from  theorem  A. 2,  (5.24),  (5.23),  (5.26)  and  the  discussion  following  it, 
it  follows  that  under  (Hj|)  alao  J§  Bg,  and  hence,  we  have  the  following. 


THEOREM  3.3.  Under  the  sequence  of  alternatives  [H^)  in  (5.23),  S^,  defined 
by  (3.13),  has  asymptotically  a  non-central  chi  square  distribution  with  p(r-l) 
d.f.  and  the  non-centrality  parameter  defined  by  (5.29),  provided  the  con¬ 
ditions  of  theorem  5.1  hold,  and  in  addition,  the  marginal  cdf's  corresponding 
to  the  Joint  cdf  0(1^, . . Jir)  are  all  absolutely  continuous  and  have  continuous 
density  functions, 

Xf  we  consider  the  large  sample  test,  defined  by  (5.22),  then  it  can  be 
1  Sg,  under 

5.3  alao  applies  to  8^.  Thus,  the  permutation  test  considered  in  sections  3 
and  4  and  the  large  sample  test  considsred  in  (5.29),  are  asymptotically  power 


(Hg),  and  hsnes,  the  conclusions  of  theorem 


shown  similarly  that 
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equivalent  for  the  sequence  of  alternative*  (Hg),  in  (5.23).  As  we  have  seen  that 
the  permutation  teata  are  easy  to  define  for  amall  samples,  we  are  now  in  a  position 
to  recommend  the  use  of  the  same,  for  all  sample  sizes. 

In  the  parametric  case,  the  limiting  distributions  of  various  test-statistics 
for  this  problem  have  bean  studied  by  various  workers,  and  the  reader  may  ba 
referred  to  Anderson  (1938,  Ch.  8*10),  Rao  [(1932,  Ch.  7),  (1963,  Ch,  8)],  and 
James  (1960),  among  others.  Host  of  the  results  relate  to  the  null  case,  while 
it  may  be  considerably  difficult  to  formulate  a  general  theory  for  the  non-null 
cases,  though  some  work  has  also  been  done  on  this  line.  For  the  likelihood  ratio 
test,  however,  the  asymptotic  non-null  distribution  may  be  found  without  much 
difficulty,  and  for  the  sequence  of  alternatives  in  (3.23),  this  statistic  can 
be  shown  to  have  asymptotically  a  non-central  chi  square  distribution  with 
p(r-l)  d.f.  and  the  non -centrality  parameter 

£  £  okq  {“  £  (X<k>  -T(k))(x5q)  -  X(q>)},  (5.30) 

u  kel  <pl  r  J-l  J  J 

where  Xjk^  and  are  defined  by  (5.23)  and  (3.29),  respectively,  and 
•1  ka  -1 

£  ■  ((a  H))  "  is  the  reciprocal  of  the  common  dispersion  matrix  £. 

The  comparison  of  Ag  and  Ay  (for  the  purpose  of  studying  asymptotic  relative 
efficiency)  poaes  the  same  problem  as  has  been  studied  in  some  detail  by  Purl 
end  8en  (1966).  For  intended  brevity,  this  is  therefore  not  reproduced  again. 

The  only  remark  that  may  be  made  here  is  that  if  we  work  with  g^»s  (defined 
by  (2.9),  (2.10),)  as  the  expected  values  of  the  order  statistics  in  a  sample 
of  site  N  drawn  from  a  standardized  normal  distribution  and  term  the  resulting 
test  es  Normal  score  MAKOVA  test  for  the  two  way  lay  out,  then  it  is  easily  seen 
that  for  normal  alternatives,  this  test  is  asymptotically  power  equivalent  to  the 
likelihood  ratio  test.  Zn  actual  practice,  the  use  of  rank  sums  (i.e.,  E^k^  ■  a/(N+l), 
,M,  kwl, ...,p)  often  results  in  a  quite  simplified  procedure  and  at  the 


I 


sane  time  does  not  involve  any  aerlous  loaa  of  efficiency.  For  detella  of  these 

points,  the  reader  may  be  referred  to  ruri  and  Sen  (1966),  the  (erne  argument 

being  true  In  the  two  way  layout  case. 
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T FISTS  FOR  OTTTT,TFRS>:< 

H.  A.  David 

University  oi  North  Carolina  at  Chapel  Hill 
North  Carolina 

1.  INTRODUCTION.  The  proper  treatment  of  outliers  has  long 
been  a  subject  for  study.  It  is  an  active  area  now  and  likely  to  remain 
so  for  some  time  to  come.  The  reason  for  this  is  easy  to  see;  there 
seems  no  limit  to  the  multitude  of  different  situations  in  which  outliers 
are  important.  Excellent  recent  surveys  of  the  subject  have  been  given 
by  Dixon  (1962)  and  Ferguson  (1961a). 

It  is  useful  to  distinguish  three  aims  of  procedures  designed  to  deal 
with  outliers; 

(a)  to  screen  data  in  routine  fashion  preparatory  to  analysis 
(this  includes  but  is  more  general  than  the  old  problem  of 
'rejection  of  outliers'); 

(b)  to  sound  an  alarm  that  outliers  are  present,  thus  indicating 
the  need  for  closer  study  of  the  data-generating  process; 

(c)  to  pinpoint  observations  which  may  be  of  special  interest 
just  because  they  are  extreme. 

Numerous  test- statistics  have  been  devised,  mostly  from  intuitive 
considerations,  and  their  percentage  points  tabulated  on  the  assumption 
of  a  common  normal  parent  population.  However,  much  more  needs  to  be 
known  about  the  performance  of  thi  various  statistics  in  use  for  the  non¬ 
null  situation  when  outliers  are  in  fact  present.  We  will  here  be  concerned 
primarily  with  cases  (b)  and  (c).  This  is  not  in  any  way  to  belittle  the 
importance  of  case  (a),  and  I  will  just  mention  a  recent  proposal  by 
Anscombe  (1966).  If  the  primary  aim  of  screening  data  is  the  estimation 
of  parameters,  Anscombe  suggests  a  two-stage  procedure:  (l)  Apply 
the  appropriate  test  for  outliers  at  a  very  stringent  level  of  significance, 
so  stringent  that  good  observations  will  very  seldom  be  rejected.  The 
purpose  of  this  ia  to  get  rid  only  of  wild  observations  very  far  removed 
from  the  main  stream.  (2)  Apply  the  same  outlier  test  again  to  the  reduced 
data  but  now  at  quite  a  moderate  level  of  significance.  This  time,  unlike 
the  preceding  3tage ,  observations  found  to  be  outlying  will  not  be  rejected 

’''Research  supported  by  the  Army  Research  Office,  Durham. 


but  rather  given  reduced  weight  in  the  estimation  of  parameters.  This 
second  stage  process  is  commonly  termed  Winaoriaatlnn 

This  kind  of  approach  promises  to  be  fruitful  for  the  situation  of  case 
(a)  although  its  properties  are  by  no  means  easy  to  investigate. 

We  shall  begin  with  a  discussion  of  several  measures  of  performance, 
including  the  power  function,  of  some  well-known  test  statistics  relevant 
to  cases  (b)  and  (c).  We  assume  that  the  underlying  variation  is  normal 
and  consider  in  some  detail  the  case  where  a  single  true  outlier  is  present 
which  differs  from  the  remaining  observations  in  mean  only.  Some  limited 
results  will  also  be  given  for  the  case  when  two  observations  are  from  a 
common  outlying  or  contaminating  distribution.  The  situation  of  an  unknown 
number  of  outliers  is  briefly  treated.  Some  of  the  statistics  we  use,  and 
others,  have  been  studied  under  these  assumptions  by  experimental 
sampling. 

2.  MEASURES  OF  PERFORMANCE.  Let  x  (1*1,  2 . n)  be 

*  2 

independent  normal  variates,  having  mean  p  ^  and  variance  <r  .  On  the 

null  hypothesis  of  homogeneity,  HQ,  the  p^  are  all  equal  to  some  unspecified 
value  p  .  We  shall  consider  alternatives  Ha  representing  a  shift  or 
slippage  to  the  right  in  one  or  a  small  fraction  of  the  p£.  A  suitable  class 
of  statistics  for  testing  HQ  against  Ha  is  of  the  form 

(1)  v  -  max  d.  , 

i 


where  d^  is  the  difference,  x-  -  x  ,  appropriately  divided.  Of  particular 
interest  are  the  following  special  cases  of  v  corresponding  to  various  degrees 
of  information  on  cr : 

(i)  standardized  extreme  deviate  (from  the  sample  mean) 

v.  =  max(x.  -  x)  /<r  =  (x  ,  -  x)  /  «r  ; 

1  *  l  '  '  max  ' 

(ii)  internally  studentized  extreme  deviate 

v2  =  (xmax  "  /  ••  *2  *  s(xi  ‘  *)V  (n  ‘  l)> 

(ill)  externally  studentized  extreme  deviate 


v,  =  (x  -  x)/s 
3  '  max  "  v 


Vj  is  appropriate  when  tr  ia  known,  in  the  absence  of  any  knowl¬ 
edge  of  a  .  The  use  of  and  requires  an  independent  estimate  of  <r  . 

In  v.  such  external  information  is  combined  with  internal  information  by 
4  2 

means  of  a  pooled  estimate  of  j  .  Formally  Vj  and  may  be  regarded 
as  the  special  cases,  v  =  ®  and  v  =  G,  of  v^. 

If  v&  is  the  upper  a  significance  point  of  the  null  distribution  of  v, 

then  Hq  is  rejected  for  v  >  v  ,  and  the  warning  required  in  case  (b)  of  the 

Introduction  is  thereby  given.  For  (c)  this  must  be  followed  up  by  declar¬ 
ing  one  or  more  of  the  xi  to  be  outliers,  for  example,  those  Xj,  for  which 
di  exceeds  v. 

Because  of  the  difficulty  of  dealing  with  more  general  alternatives  we 
shall  first  suppose  that  just  one  of  the  observations  —  we  do  not  know 
which  —  is  a  true  outlier  and  has  mean  p  +  A(A  >  0).  In  the  formulation 
of  slippage  tests  we  may  say  that  Ha  consists  of  n  mutually  exclusive 
hypotheses  of  which  the  ith,  Hp  specifies  that 


Pj  =  P  +  X .  Pj  =  P  (j  =  1 .  2 ,  ....  i  -  1 ,  i  +  1 ,  . .  .  ,  n)  . 


It  is  known  (e.  g.  Kudo,  1956)  that  in  this  situation  v^  (and  hence  Vj,  v ^  when 

applicable)  has  the  desirable  optimal  property  of  maximizing  the  probability 
of  rejecting  a  true  outlier  in  the  class  of  all  level  a  tests  which  are 
invariant  under  the  transformation  x!  =  ax.  +  b(a  >  0)  applied  to  each  x^. 

It  is  clear  that  a  reasonable  measure  of  the  performance  of  any  of  the 
v-statistics  can  depend  only  on  the  sample  size  n  and  the  ratio  A /<r  .  In 
particular,  the  measure  must  be  Independent  of  which  of  the  holds.  For 
convenience  we  therefore  take  i  -  1,  and  also  <r  =  1.  The  following  measures 
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come  to  mind : 

1.  Power  function  P5  =  Pr(v  >  vn  J  Ptj ) . 

2.  Probability  that  the  observation  from  the  slipped  population 
is  significantly  large 

vJHi>- 

3.  Probability  that  x^  is  significantly  large  and  the  largest  in  the 
sample 

P3  *  P'(‘il  *  V  *1**2'  x3 . XJH1>' 

4.  Probability  that  only  is  significant 

P  =  Pr(d  >  v  ,  d  ,  d . d  <  v  [H  ). 

4  *  1  a  2  3  n  a 1  1 

5.  (Dixon,  1950)  Probability  that  x^  is  significantly  large  given 
that  it  is  the  largest  in  the  sample 

P5  aPr{dl>vo!xi  >x2 . xn:  Hl)* 

We  see  that 

<2>  P,iP2  i  P3  1  P4 

and  also  that 

(3)  P5  =  P3/Pr(x1  >  x2<  x3>  . .  .  ,  xn)  , 

where  the  probability  in  the  denominator  has  been  tabulated  by  Teichroew 
(1955)  for  n  <  10. 

It  can  be  shown  that 

?2  <  ?!  <  ?2  +  *  • 

% 

provided  n  <  2/a  ;  in  fact,  a  somewhat  stronger  general  inequality  holds 
(David  and  Paulson,  1965).  Also  for  v_  one  has  P,  =  P  =  P  ,  We  there- 

fore  confine  attention  to  P^  as  the  most  convenient  measure. 
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The  graphs  of  Figure  1  show  inter  alia  just  how  much  is  added  to  the 
value  of  P?  by  the  use  of  rather  than  v,  in  the  present  case  of  a 

single  true  outlier.  Of  course,  the  gain  is  highest  when  the  internal  infor¬ 
mation  on  cr^  is  large  compared  to  the  external  information,  i.e.  when 
n  -  1  is  large  compared  to  v.  However,  there  are  indications  that  internal 
degrees  of  freedom  are  less  valuable  than  external  ones.  Thus  for  n  *  6 
the  solid  curve  v  =  5  lies  well  above  the  dotted  curve  v  =  0  although  in 
both  cases  there  is  a  total  of  5  D.  F. 


3.  A  SEQUENTIAL  PROCEDURE.  It  will  be  a  rare  occasion  when  we 
actually  know  the  number  of  outliers  for  which  to  test.  Ideally  we  might 
wish  to  proceed  sequentially  as  follows; 

Apply  a  certain  test -statistic  to  the  sample  of  n.  If  significance 
is  obtained  eliminate  the  most  extreme  observation  and  apply 
the  same  test -statistic  to  the  reduced  sample  of  n-1,  adjusting 
the  significance  point  to  the  new  sample  size.  If  significance 
holds  again,  repeat  the  procedure  until  the  test-statistic  ceases 
to  be  significantly  large. 

We  consider  now  such  a  sequential  procedure  for  v^,  the  case  where 

<r  is  known  and  may  be  taken  equal  to  unity.  To  this  end  note  the  following 
easily  proved  algebraic  results; 

(a) 


(b)  x.  -  x,  >  x.  -  x,  according  as  x  -  x  >  x.  -  x  . 

'  ij<ji  8  i  <j 

(c)  x.  -  x  +  - r-  (x.  -  x)  =  x.  -  x.  . 

*  i  n-1  '  j  '  i  j 

Also  when  the  x.'s  are  normally  distributed. 


xjL  -  x  = 


n-1 

n 


-  *i)- 


where  x^  = 


n 

£ 

t=l 

t^i 


xt/(n-l). 


(d)  xA  -  x  and  Xj  •  x^  are  statistically  independent. 

In) 

(e)  v'  '  ,  the  upper  a  significance  point  of  v.  in  samples  ofn, 

i  »a  l 

is  an  increasing  function  of  n. 
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From  (c)  and  (d)  we  see  that  the  joint  occurrence  of 


(4) 


-  (n)  -  (n) 

X  -  X  >  V*  X  “  X  >  V* 

i  1,0  j  l,a 


implies 

(f)  >  v^r  , 


and  by  symmetry  that 


(n-1) 
v\  '  . 
1,0 


This  result  means  that  we  do  not  have  to  take  the  above  procedure  too 
literally:  if  (4)  holds  we  can  immediately  declare  both  x^  and  x^  to  be 

outliers,  and  next  apply  our  test-statistic  to  the  remaining  sample  of  n-2, 
etc. 


To  evaluate  the  performance  of  this  procedure  we  consider  a  rather 

special  case:  two  observations  x.  and  x.  (i,  j  unknown)  are  from  a 

'  J 

contaminating  N(p  +  A,  1)  (A  >  0)  population,  the  remaining  n-2  are 
from  N(p,  1).  This  is  a  reasonable  model  for  the  situation  when  a  common 
source  is  responsible  for  the  shift  in  the  two  observations.  Any  accept¬ 
able  measure  of  performance  will  not  depend  on  i  and  j  which  we  take 
to  be  1  and  2.  We  consider  the  following  measures; 

1.  Probability  that  at  least  one  of  x^,  is  significantly  large: 

II.  =  Pr  {max(x  -  x,  x  -  x)  >  v(*^  }  . 

I  14  1  1 01 

2.  Probability  that  both  x.  ,  x  are  significant  in  a  2-stage  proce¬ 
dure: 

n  =  Pr{max(x.  -  x,  x,  -  x)  >  v ■,  ,  min(x  -  x  ,  x  -  x  )  >v^n~^}. 

3.  Probability  that  both  x^,  are  significant  at  the  first  stage; 

II,  =  Pr{x  -  x  >  >>  ,  x,  -  x  >  v^  }  . 

(In  these  measures  we  are  not  concerned  with  the  possibility  that  good 
observations  may  also  be  declared  outliers. )  It  is  clear  that  n.  >  n.  >  II,. 
Ilg  may  be  found  from 
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n,  =  Pr{x  -  H.  >  v(,n)  ,  x  -  X  >  >'l)  } 
2  1  l,a  2  1  l,a 


-  .  (n) 


+  Pr{x2-x  >vY;a,  -i2>  vJ1B‘ftl)> 

-  Pr{x.  -x  >  vjn>  x  -  x.  >  v\n'l\  x?  -  x  >  v(n)  ,  x.  -  x  >  v|n'1^  } 

1  l>a  2  1  l,o  2  l,a  1  2  l.o  J 

*  2Pr{x.  -  x  >  v(n) }  Pr{x,  -  x.  > 

1  1,0  C  1  1,0 

-  Pr{Xj  -  x  >  yW.  x2  -  x  >  v|n^}  by  (d)  and  (f)  . 


Hence  as  well  as  11^  and  11^  can  be  evaluated  from  tables  of  the 

univariate  and  bivariate  normal  distribution  function.  Figure  2  gives 
some  numerical  results  comparison  being  also  made  with  the  earlier 
probability  (P2  for  v  »  m)  of  detecting  a  single  outlier  when  only  one  is 

present.  The  difference  between  II  and  II,  is  seen  to  become  less  marked 
as  n  increases. 

Some  extensions  of  these  results  to  the  above  cases  of  e r  unknown  are 
planned.  It  must  not  be  supposed  that  the  results  will  all  be  much  the  same. 
When  a-  has  to  be  estimated  from  the  sample  at  hand  the  presence  of  a 
second  outlier  tends  to  "mask"  (Murphy,  1951)  the  effect  of  the  first.  In 
fact,  for  a  =  .05  and  n  <  14,  the  probability  of  detecting  any  outliers  by 
the  uee  of  v2  tends  to  zero  as  \  -*  ».  (cf.  Ferguson,  1961b).  For 

finite  \  the  probability  of  detection  may  be  quite  unsatisfactorily  low  and 
the  sequential  process  has  little  chance  of  ever  getting  started.  The  mask- 

ing  effect  applies  also  to  other  statistics  such  as  Dixon's  r.n  s— »-* - »  ' 

10  X(n)  '  X(l) 

which  might  be  used  sequentially  in  this  case,  Ferguson  (1961b)  recom¬ 
mends  Karl  Pearson's 

b2  “  nS(xi  "  x)4/[2(xi  -  x)2]  2 


as  a  general  statistic  appropriate  for  both  one  and  two-sided  tests. 

It  should  also  be  noted  that  in  the  artificial  case  where  the  above 
model  of  exactly  two  outliers  is  known  to  be  the  right  alternative  to  H0 
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the  optimal  procedure  consists  (Murghy)  in  rejecting  the  largest  two 

observations  when  (x,  ,  +  x,  . .  -  2x)/a  Percentage  points 

-  in;  in-x; 

are  not  known  but  are  available  for 

n-2  _  n  . 

S  1  J/  £  ^xi  ‘  x)  Grubbs  (1950). 

i«l  W  n’i,n  isl  1 


Dixon  (1951,  1962)  gives  percentage  points  for  several  of  his  r- statistics, 
e.g.  for 


20 


x,  .  -  x. 

-.(s?  1  fc,:.2). 

x(n)  ‘  x(l) 


designed  at  a  test  "for  avoiding 

Although  only  a  fraction  of  the  many  questions  of  interest  have  been 
considered  in  this  paper  I  hope  that  the  need  for  much  more  detailed 
knowledge  of  the  performance  of  tests  for  outliers  has  been  demonstrated. 
Of  course,  it  must  never  be  forgotten  that  the  problem  of  outliers  is  only 
partly  statistical. 

Section  2  of  this  paper  is  based  on  David  and  Paulson  (196*5)  where 
further  details  are  given.  I  am  indebted  to  R.  G.  McMillan  for  Figure  2. 
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ABSTRACT.  This  report  deals  with  the  analysis  of  a  model  for 
studying  the  probability  of  survival  of  a  subterranean  target  under  an 
intensive  attack.  Most  of  the  analysis  is  baaed  on  the  assumption  that 
the  explosions  are  circularly  distributed  about  the  target  and  that  the 
number  of  explosions  is  known.  In  the  last  two  sections  it  is  shown 
what  effect  a  relaxation  of  these  assumptions  has  on  the  probability  of 
survival  of  the  target. 


The  rest  of  this  artical  was  reproduced  photographically.  It  is  MRC 
Technical  Summary  Report  No.  653,  December  1966, 


1.  Introduction 


Tula  tepoix  ueais  with  the  analysis  oi  a  model  tor  studying  the  probability 
of  survival  of  a  subterranean  target  under  an  Intensive  attack. 

The  target  is  located  below  the  surface  at  u  distance  d  u'om  the 
surface.  The  projection  of  the  target  on  the  surface  will  be  identified  as 
the  origin  in  ordinary  two-dimensional  rectangular  coordinates.  K  explosions 
occur  at  points  X  ,5(  _,..., X  ,  where  X.  are  independent  identically  dis- 
tributed  random  vectors,  X^  =  (X  j,X  2)  •  They  will  be  assumed  to  have  the  bi¬ 

variate  normal  distribution  centered  at  the  origin  with  zero  correlation  coefficient, 
i.  e. , 

2  2 
X  X 

(  1)  f(xJ(x2)  =(  21^0-2) exp(  +  “f  })  ( -“<x1,x2<=e  jcTj,  cr2  >  0)  . 

°1  *2 

The  energy  directly  applied  into  the  ground  will  be  denoted  by  E  and  the 
seismic  velocity  of  the  rock  will  be  denoted  by  c  .  The  distances  R^, 
i  =  1,  2,  . ,  K,  of  the  explosions  from  the  origin  are  consequently  independent 
identically  distributed  random  variables,  and  from  (1),  their  common  probability 
density  function  is  given  by 
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1 


|2)  ¥r)  ■j^/“*-4{i^  +  iSi-a>>d0  ■ 

12  0  or.  <r_ 


In  particular,  if  a,  -  tr„  =  <r,  then 

U) 


i  2  /  _  2 

,  .  .  -2  -r  /  2<r 

fj(  r)  =  rcr  e 


(  o  <  r  <  oo )  , 


It  will  be  assumed  that  the  free  field  stresses  P^,  i  =  1,  2, . . , ,  K,  due  to 
the  explosions  are  given  by  * 


(4) 


a  3  2  2  -  v 

Vxc  E  <Ri  +  d  >  v 


i  »  1,2 . K 


where  \,  a,  p.  and  y  are  positive  parameters.  Therefore  P,P . P  are 

i  J\ 

independent  and  identically  distributed  random  variables. 

The  following  assumptions  will  be  made  about  the  survivability  of  the  target. 

(1)  If  max  P  >M,  the  target  will  fail.  That  is  to  say,  M  is  the 

1  <  i  <  K 

maximum  loading  from  a  single  burst  which  the  target  can  withstand  without 
failure.  * 

(2)  If  P^Pg,  no  damage  to  the  target  takes  place  from  the  i  burst. 
Further,  if  P  >  pQ,  some  permanent  damage  is  done  to  the  target,  in  an  amount 
proportional  to  P  -  pQ  •  pQ  is  the  elastic  limit  of  the  target  structure.  Thus 


we  define 


(5) 


11  Vo 


otherwise  . 


(  i  =  1,  2  ,  ■  •  • ,  K) 


and  Di  is  known  as  the  degradation  due  to  the  ith  burst.  The1  target  will  also 


fail  to  survive  the  K  explosions  whenever 

K 

(6) 


Ed1>d-  . 

i=l 
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»■'  * 


i 


ill 

Here  D  is  called  the  maximum  allowable  cumulative  degradation.  It  is  assumed 
that  the  accumulation  o f  permanent  damage  is  additive  and  has  no  effect  on  the 
amount  of  damage  produced  by  any  subsequent  explosion,  or  on  M,  the  maximum 
loading  from  a  single  burst  whioh  the  target  can  sustain. 

Thus  we  have  that  the  target  will  survive  K  explosions  whenever 
K  * 

(7)  E  D.  <D  and  max  P  <  M  . 

1=1  1  <  i  <  K 

* 

The  following  relations  between  pQ,  M,  and  D  will  be  assumed  to  hold, 

pQ  <  M  <  D*  +  pQ  . 

Minor  modifications  in  the  analysis  that  follows  would  be  needed,  if  this  were 

not  the  case.  However,  it  is  clear  that  these  are  consistency  requirements 

which  should  reasonably  be  satisfied  by  the  three  parameters  given  above. 

In  Section  2,  we  obtain  the  probability  density  function  of  the  free  field 

2  2 

stress  due  to  a  single  explosion,  when  =  a  ^  .  This  will  be  exploited  in 
Section  5,  by  exhibiting  a  number  of  examples  to  show  how  <.  straightforward 
examination  of  this  probability  density  function  may  be  employed  in  estimating 
the  probability  of  survival. 


Section  3  contains  a  discussion  of  techniques  for  estimating  the  probability 

2  2 

of  survival  when  K  is  fixed  ( 1.  e. ,  not  a  random  variable) ,  and  when  <r  ^  =  cr  2 
(the  circular  bivariate  normal  distribution) .  The  approximation  methods  used 


Section  ‘6  discusses  some  methods  which  may  be  employed  if  ,'f  *  *  2  (the 
elliptic  case).  These  are  comoared  with  results  obtained  in  Section  3  for 

l  2  \  V  •  ■ 


I  t  ’  :  t.  V.  •• 

*  l  *"  M  -  v  ,  .  .  .  ■  .V,  ’  .  . 

Finally,  Section  7  provides  a  brief  disoussion  of  the  extension 


e  pre¬ 


vious  results,  if  K  is  a  random  variable,  rather  than  a  fixed  quantity. 


2.  The  Probability  Distribution  of  the  Free  Field  Stress  of  a  single  Explosion 
A  substantial  amount  of  useful  information  may  be  obtained  by  a  careful  ex¬ 
amination  of  the  probability  density  function  of  P,  the  free  field  stress.  We 
will  derive  this  function  in  this  section,  and  note  some  of  its  properties.  These 
will  be  exploited  in  Seotion  5  of  this  report. 

It  will  be  convenient  to  define 

(8)  e*Xc*EP  .  *' 

Thus,  from  ( 4) ,  we  have 

(9)  '  P-6<R2  +  dVV  (0<R<«)  . 

P  =  P(R)  ,1s  a  mapping  from  [0,  so)  to  (0,ed"2yJ  .  On  [0,«o),  p  is  a  mono¬ 
tonic  decreasing  function  of  R,  and  thus  the  inverse  mapping  p"\  p)  is  unique¬ 
ly  defined  for  every  p,  0  <  p  <  6d  2  v  ,  and  Is  a  posltiva  monotonio  decreasing 
function  of  p%.  Indeed 

L 

(10)  p-1(p)  =[(e/P)1/v-d2)2  (o<p<e<f2v)  . 


Henoe 

(ID 


Pr{P  <  p}  -  Pr{R>  P“l(p)} 

*A»  pOO 

rn  I  f j( r) dr  (0<p£8d*2y) 

-1*' 

P  ( P)  ' 

\f»  7 


JL 


where  f^(  r)  is  given  by  (2)  or  (  3)  and  P  ( p)  is  given  by  ( 10}  . 

We  will  restrict  ourselves  to  the  case  <r  ^  =  <r  ^  until  Section  6.  With  this 
restriction,  f^(  r)  is  given  by  (  3) ,  and  integration  of  ( 11)  yields 


^/Y  ,2, 


(P  <  0) 


(12)  .  Pr{P<p}  ■  G(p)  =  )  e-{{e/p)  -d  >/2cr  (o  <  P  <  ed_2Y)  . 

Li  (p  >ed“2v  ) 


Then,  the  probability  density  function  of  P  is  given  by 

(13)  g(p)  M2Y)"1  o-“2el/vp'(v+1)/v  exp{-[(0/P)l/Y-d2]/2<r2}  (O<p<0d"2v) 

We  now  proceed  to  investigate  some  of  the  characteristics  of  g(  p)  . 

The  median  Mp  of  g(  p)  is  readily  obtained  by  solving 


Hence 


-  {(e/Mp)1^-  d2}/2ir  2 


G(Mp)  a  g 


{(rf-)l/Y-dZ}/2«r2al°g2  . 
P 


Mp  =  0(  2<r  2  log  2  +  d2)"V 


Here  <rv21og2  is  frequently  referred  to  as  the  CEP  (  circular  error  probability) , 
so  that  we  may  also  write 

Mp  *  B[(CEP)2  +  d2]"v 

Similarly,  the  vth  percentile  of  P  may  be  obtained  by  setting  the  right  hand 
side  of  ( 14)  equal  to  v/100  . 

We  can  find  the  mode  of  g(  p) ,  which  we  denote  by  mp,  by  solving 
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(16) 


or 


+ 


2Y  <rC 


m 


V  Y+i)  /  Y 
P 


n 


(17)  ^  mp  =6[2(y+1)]“Y  <r"2v  . 

Thus,  since 

q(  0)  *  o,  «( ed“2Y  ) » ( 2v)-1  tr“2  e _1  d2(  Y+1)  >  o 


g(  p)  has  a  unique  mode  given  by  ( 17) ,  whenever 

m  p  <  fid 

or  equivalently, 

(18)  d  <  r  2(  Y+l)  ]1/2  0-  • 

1/2 

If,  on  the  other  hand,  c1>[2(y+1)]  <r,  then  g(p)  is  monotone  increasing,  and 

m.  2  \J 

the  maximum  of  g(p)  occurs  at  6d  . 


We  conclude  the  characterization  of  g(p)  by  evaluating. the  moments  (both 

conditional  and  unconditional) .  Let  A  be  any  measurable  set  on  { «)  ;  then 
th 

the  conditional  k  moment  of  g(  p) ,  ^  ^  ^  is  given  by 

,19)  **k,A  "E{pl4'P*  Ai 

-  E{ek{R2 +d2)  “Yk  |R.  P"1(A)) 


E 


Pr{R  <  P_1(A)} 


f  trV,-'k 

P-I(A) 


which  is  obtained  using  (3)  and  (  9) . 

Two  particular  cases  of  ( 19)  merit  explicit  statement. 

(1)  If  A  is  an  interval  (PlfP2)  with  0  <  pt  <  p2  £  8d"2y  ,  then 
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00  -1 

."T(a»  x>  =  /  e  .*  t*  dt ' 

■  ■  ■  x  • 

is  the  incomplete  gamma-function. 

(  2)  if  pj  ■  o,  p2  =  0d’2v  ,  then  *  0,  r2  end  we  obtain  the  un¬ 
conditional  kth  moment '  • 

(24)  !(■*).,.  iVll,, 

K  ( 2<r)  YK 


3.  Estimation  the  Probability  of  Survival 

We  will  provide  two  formulas  for  estimating  the  probability  of  survival.  The 
first  {  27)  is  more  accurate,  but  substantially  more  difficult  to  compute.  The 
second  (  30)  should  nevertheless  provide  a  good  approximation  for  large  K  . 

Both  approximations  employ  the  central  limit  theorem  of  probability  theory. 

Let  T  be  the  event  described  by  ( 7) .  Then  Pr{T}  la  the  probability  of 
survival.  We  may  write  the  event  T  as  follows: 

K  * 

T  «  U  { m  of  P.,P  ,  ...,P  >p.,  h  P,  <D  +mp0,  max  P  <M>  . 
m«°  12  K  0  i  0  llllK  1 


Thus 

Pr{T}  -  Z  OPr*Pl,P2 . Pm>P0»  Pm+l'Pm+2',,,,PK<P0»  ^.Pi-D  +mp0 

m«>0  i»l 

max  P,  <  M  } 

1<  i  <  K 

•  f  Pi-D,"+mpo'p0<Pi.  ••  •  P  <M)  e'(K“m)f(e/pO)1/,V-d2]/2(r2 

m*0  m  l-l  1  12  "> 

XPr{p0<P1,P2,...,Pm  <M}  . 
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The  last  factor  Is  evaluated  as  follows; 


Pr  {p0  <  p!’  p2»  <  Ml  •  ■:  -A  -  ;■ 

-[(6/M)l/v  -d2]/2<r2  '  -T(0/PJ1/V  -d2l/2o-2  •  ,1 

{e  •  -e  0 


(P0  <  ed“2v<  M) 
(ed"2v< p  ) 


To  oomplete  the  approximation,  we  estimate 

m  * 

(26)  Pr { 2  PA  <  D  +  m‘;0IPo  <  Pl,P2'**,,Pm  < 

1  i 

-  2v 

by  means  of  the  central  limit  theorem.  Noting  that  if  M  >  0d  ,  M  plays  no 

role  in  conditioning,  we  replace  (26)  by 
m  ^ 

Pr  {  X)  PtiD  +  mp0lp0  <  P1,P2,...,Pm  <min(M,0d“2^  )}  . 
i=l 

“  2  v 

Then,  setting  p1  =  pQ,  p2  *  min{  M,  0d  T)  and  A*(PltP2),  we  can  obtain 
pj  A  and  p2  A  from  ( 20)  and  (  23) ,  Finally,  the  central  limit  approximation  to 
{  26)  is  given  by 

I?  *  / D^  +  mPrt -mp.  .  \ 

Dai  V  O  >  «  i  .  I  _  ^  n  n  n  —  1  _  I  _ Q _ 1  •  A  I 


fl}  *  /D+m Pf.-mn.  . 

l  PilD  +mP0|po  <Pl»P2»’*‘*Pm  <P2}  'Vr - =4 

lB  '^m(>i2,A“(,Al,A)  * 


where 


« ( x)  □  -dbr  / 

\l2ir  -» 


X  -tZ/2 
e  w  dt  . 


Thus  we  have 
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We  now  extend  { 28)  to  obtain  the  conditional  moments  of  D  conditioned  on 
{PA<M,  M>p0)  .  Clearly, 

E{DklP  <  M}  «  E{Dk|P  <  min(M,6d  *2v  ) }  • 


Let  p^Pq,  p2  «  min(M,ed"2Y  )  and  define  rj  and  r2  by  (21)  and  yj  and 


y2  by  ( 22)  .  Then, 


E{Dk|P>P0) 


V2ff2  .k  -2  -r2/2<r2 

3  J  (p-p  )  r«r  e  ' 


k  .  eJ(-Dk  Pq 

*  £  (?)  “ — TT, - =7T  ^r<1-  yJ  »y,>  -r(i-  yJ  ,  y,))  • 

.  ...  j*0>  (ZaV]  e  1  2 

In  particular,  if  we  denote  E{d|p<p2)  by  Vj  and  E{D2|p<p2)  by  v2  ,  then 


we  have 


Pi  jlj 

Pt{T)  ®  Pr{£  D,  <  D  |  max  P.  <  M }  Pr{  max  P  <  M } 
i=l  1  1  <  i<  K  1  l<i<K  1 

[  D  “Kvl  ^  -fly^d2/***) 

Pr{T)  -  o \-===r je 


iK(v2-(v1)“) 


At  this  point,  we  digress  briefly  to  note  certain  aspects  of  the  assumptions 
which  have  been  made. 

In  Sections  2  and  3,  we  have  assumed  that  the  number  of  explosions  was  a 
fixed  quantity.  However,  it  may  appear  more  reasonable  to  suppose  it  to  be  a 
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random  variable..  We  can  see  this  as  follows.  If  N  missiles  are  fired  at  the 
target  then  some  may  not  explode  by  virtue  of  defects  and  some  may  be  Intercepted 
by  defenses.  Hence,  for  any  given  target,  it  may  be  reasonable  to  assume  that 
the  number  of  missiles  which  explode  is  a  random  variable  whose  probability  dia- 
trlbution  depends  on  the  number  of  missiles  fired  at  the  target,  the  reliability  of 
the  missile  system,  and  the  nature  and  extent  of  the  defenses  of  the  target.  In 
Section  7,  we  provide  a  brief  analysis  of  this  problem.  The  results  of  Sections 
2  and  3  will  nevertheless  provide  reasonable  approximations  to  this  more  compli¬ 
cated  model  in  a  large  variety  of  situations.  In  order  to  use  these  results  in  this 
manner,  "K"  in  Sections  2  and  3  should  be  Interpreted  as  the  expeated  value  of 
the  random  variable.  This  is  accomplished  by  permitting  K  in  (  30)  to  assume 
arbitrary  real  positive  values,  despite  the  fact  that  In  the  derivation  of  ( 30) ,  K 
has  been  presumed  to  be  an  integer. 

Then  we  note  that  the  assumption  of  tho  oiroular  normal  distribution,  i.  a. 

<ii  ■  a  2,  which  has  bean  employed  throughout  Sections  2  end  3,  may  not  be 
completely  justified.  The  usual  nature  of  ballistics  problems  would  suggest  that 
the  two  parameter  family  of  probability  density  functions  given  by  (1)  should  be 
more  appropriate,  since  there  seems  to  be  no  reason  to  assume  that  the  two  error 
components,  distanoe  and  lateral  errors,  should  have  the  same  variance.  This 
assumption  is  relaxed  in  Section  6,  in  which  we  give  a  brief  disoussion  of  some 
suggestions  tor  treating  the  more  general  problem. 

In  addition,  it  may  be  noted  that  the  model'll  quite  sensitive  to  the  choice  of 

the  coefficients  and  exponents  in  (  4)  j  such  as  \o®EP,  which  we  have  denoted  by 
S,  for  instance. 


In  Section  5,  an  example  is  provided,  which  shows  that  two  moderately 

different  choices  of  fi  can  produce  drastically  different  results. 

We  also  note  that  M  remains  constant  during  the  entire  bombardment.  It 

m 

would  seem  more  reasonable  to  assume  that  if  ][]  D.  >  0,  m  <  K,  then  the 

i=l  1 

vulnerability  to  a  single  shock  should  be  reduced  for  later  shocks,  since  the 
target  has  already  suffered  some  damage. 

Moreover,  the  basic  formula  (  4)  ,  used  in  computing  the  free  field  stress, 
appears  to  have  certain  defects.  We  point  out  in  particular  one  defect. 

If  the  target  is  located  on  the  surface  ( i.  e.  d  =  0) ,  then  the  free  field 
stress  for  a  direct  hit  is  infinite,  regardless  of  the  magnitude  of  E  .  There  are 
many  other  plausible  choices  which  might  be  used  in  place  of  (  4)  and  would 
still  approximate  (4)  for  d>  0  without  the  defect  at  d  -  o  .  However,  we  have 
proceeded  under  the  assumption  that  (  4)  will  give  satisfactory  results  for  those 
values  of  R,  d,  c,  «,  (3,  y  and  E  which  are  in  regions  of  interest  to  potential 
users  of  the  results  cited  in  this  paper. 

Whatever  assumption  we  use  in  place  of  {  4)  ,  there  is  still  the  followmg 
concern.  Since  (  4)  ,  or  its  replacement,  will  be  obtained  from  empirical  data, 
we  must  assume  that  it  is  only  approximately  valid,  but  not  exactly  valid.  Then 
let 

p  =  Pr  {P  <  M} 

be  the  exact,  but  unknown  probability.  The  answer  given  by  (  4)  ,  may  be  de¬ 
noted  by  p  +  6  .  Then,  for  K  explosions, 
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Pr{P1  <  M,  P2  <  M,  ....  PK<  M}  s  pK 
£ 

which  wo  estimate  by  (  p  +  6)  .  If,  we  compare  these  two  quantities,  we  havo 

that  approximately 

l  *  mK  _  K  CK/ p 
(  p  +  6)  ~  p  e  r 

for  6  small  compared  to  p  .  Thus,  for  K  large,  very  substantial  errors  may 
be  produced.  We  exhibit  the  magnitude  for  one  simple  example.  Let  p  *  .  93 
6  «  .  04  and  K  s  12  ,  Then, 

PK  =  .4186  (p  +  6)K  =  .8704 

and 

[<p+  6)  /p]  K  =  2.  079  . 

Hence,  even  if  (  4)  is  nearly  correct,  so  that  p  is  approximated  fairly  well  by 
use  of  ( 12) ,  raising  to  a  large  power  will  introduce  very  big  errors. 

5.  Some  Illustrations 

\Ve  now  show  how  Sections  2  and  3  may  be  employed  to  analyze  the  problem  for 
several  ranges  of  parameter  values,  using  a  variety  of  rough  approximation  methods. 

Example  1.  For  a  certain  subset  of  parameter  values,  q,  =  Pr  {  max  P  >M) 

l<i<K  1 

may  be  close  to  unity.  If  this  Is  the  case,  it  Is  immediately  apparent  that  since 

r 

Pr{T}<l  -  q^,  Pr{T)  Is  close  to  zero. 

From  ( 12)  ,  we  have,  for  M  <  ed-2^ 

(31)  qM  ’  Pr{  max  P.  >  M}  =  (l-e"*(e/M)  }/2ir2)K 

1  < i <K  “  V 

Thus,  for  0  <  f  <  1,  >  1  -  «  Is  equivalent  t.o 
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j  _(]_{//K  e-{(9/M)l/Y-  d2>/2cr2 
Hence,  q^  >  1  -  «,  whenever, 

(3  2)  d2  -  2<r2  Iog(l-(1-Ol/K)<(|1)l/V 

Therefore,  whenever  (  32)  holds  for  sufficiently  small  «  >  0,  it  is  apparent  that 
the  probability  of  survival  is  negligible.  In  general,  evaluating  q^  (  31) 
provides  an  easily  computable  upper  bound  for  Pr{T}  . 

-  2  v  # 

Example  2.  We  now  assume  that  M  >  0d  1  =  p  ,  so  that  M  plays  no  role  in 

the  computation  of  the  survival  probability.  If  in  addition,  pQ  exceeds  nip  then, 

>!< 

since  g(  p)  is  monotone  decreasing  for  pQ  <  p  <  p  ,  we  may  be  able  to  replace 
g(  p)  by  a  simpler  function,  such  as  a  linear  function  or  an  exponential  function 
in  that  region.  We  will  now  briefly  discuss  the  approximations  obtained  in  this 
manner. 

From  ( 12)  , 

-{( ©/pn)l/Y  -  d2}/2*2 

(33)  Pr{P<pQ}=e  =  G(  pQ)  . 

Thus,  out  of  K  explosions,  on  the  average,  ( I  -  G{  p  ) )  K  will  have  a  free 
field  stress  exceeding  p 

0 

Let 

(34)  p  =  ^p0  +  p*)  . 

Then,  if  we  expand  g(  p)  in  a  Taylor  series  about  p,  we  obtain 

(35)  g(  P)  =  g(  P)  +  (  p-  p)  g'(  p)  +  R(  p)  ,  PqIP^P*  , 
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where 


R(P)  =i£f^Lg"(l) 

for  some  p  between  p  and  p  . 

If  R(  p)  is  sufficiently  small,  we  can  replace  g(  p)  by 

(36)  3(p)  =  g(  0)  +  (  p -?>)  g'(  p) ,  P0£P5P*  • 

Defining  h(  p)  by 

(  37)  h(  p)  =  §(  p)/(  1  -  G{  pQ) )  ,  P0  <  P  <  P*  , 

we  see  that  h(  p)  Is  approximately  a  probability  density  function  and  the  conditional 

moments  of  P  are  approximately  the  uncc  ^itional  moments  of  h(  p)  ,  That  is, 

P* 

(38)  E(Pk  |p  <  P  <  p*>  ~  /  pkh(p)dp  . 

P0 

To  evaluate  the  integral  in  (  38) ,  it  is  convenient  to  write 
h(  p)  =  h,(  p)  +  h2(  p)  , 

where,  since  h(  p)  is  trapezoidal  by  (  36)  and  (  37) ,  we  can  write 
bj(  p)  =  h(  p*) ,  PQ  <  P  <  P* 

and 

h2(  p)  =  h(  p)  -  h(  p*) ,  PQ  <  P  <  P*  , 

ii< 

is  a  linear  function  with  h2(  p  )  =  0  . 

The  following  elementary  results  of  probability  theory  can  now  be  employed; 

1)  The  moments  of  the  rectangular  distribution  on  (0,b)  are  given  by: 

,  -1  fk  k  .  bk 
pk  =b  f  x  dx  — 
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-2 

2)  The  moments  of  the  triangular  distribution  on  (  0,  b)  defined  by  f(  x)  =  2b  (  b-x) , 

0  <  x  <  b  are  aiven  by 

b  .  ,  , 

v  =  /  2xK  b  {  b-x)  dx  =  2b  /(  k+2)  ( k+1)  . 
k  0  ' 

^  •  i 

It  is  convenient  now  to  identify  with  0  and  p  -pQ  with  b  .  That  is,  we  i 

define  j 

fi(  p  -  p0)  =  h(  p)  PQ  <  P  <  P  ‘ 

and 

^(p-Pq)  =hi(p)  i  =  1,  2;  pQ  <  p  <  p*  . 

Then,  it  is  clear  that  there  is  a  constant  £,  0  <  t,  <  1,  namely  j 

>:<  .* 

i  ■  /  h.(  P)  dp  =  h(  p*)  (  pS,‘  -  p  )  \ 

P0 

* 

such  that  C-1hj(q)  is  the  rectangular  distribution  on  (0,p’'  -  and 
( 1  -  i)  h2(  q)  is  approximately  the  triangular  distribution  on  (  0,  p‘  -  p  )  . 

Hence,  the  moments  of  q)  are  approximately  given  by 

c(P*-P0)k  (l-;)  2(  p’!<-p0)k  (p’!<-p0)k  6k+2 

\  =  k+l  +  ( k+1)  ( k+2)  “  (k+1)  *  k+2  *  1 

i 
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Thnq  for  If  =  1  WB 

*  ~t  "  “ 

e{p!pq  <p<p*}  -  p0  + 

We  can  interpret  the  above  calculations  as  follows.  About  ( 1 -G(  PQ))K  explosions 
will  exceed  pQ  ,  and  of  these,  the  average  force  will  be  about  PQ  +  “j  •  Con¬ 
sequently  the  average  degradation  per  explosion  exceeding  pQ  will  be  about  a  , 
Hence,  the  probability  of  survival  Pr(T)  will  approximately  satisfy 


Pr{T}  >  .  5 


if 


D  >  tfjd  -G(P0))K 
and  less  than  .  5  otherwise. 

A  more  refined  estimate  of  the  probability  of  survival  can  be  obtained  by 

* 

computing  the  variance  of  P  conditioned  on  pQ  <  P  <  p  .  Since  the  variance  is 

2  2 

translation  invariant,  i.  e,  o-  *  cr  for  all  real  numbers  a,  the  variance  is 

x  x-a 

given  by  «2  -  a^,  and  hence 

2  .1 P_3—  {2L±J.).ff2  . 

p|pQ<P<p*  3  '  4  '  1 

We  can  now  apply  the  central  limit  theorem,  obtaining 


Pr{T}~* 


D  ..«1(l-Q(p0))K 


\*i<l-Q(  P0 »Ko""p"|  Pq<  p  <  p*  I 

An  alternative  procedure  which  leads  to  estimating  g(  p)  by  an  exponential 
function  can  be  constructed  as  follows, 

Expand  ""  fliven  ln  06)  in  a  Taylor  series  about  $  obtaining, 
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I 


li°a.ai£l  ,  T  t  p  -  0,  . 

dP 

Solving  the  indicated  differential  equation,  we  have 
9<p)  '  e“+TP  +  (»/2)(p-?)2  , 

If  we  can  assume  that  the  second  degree  term  in  the  exponent  can  be  ignored, 
then  g(  p)  has  an  exponential  approximation.  The  conditional  moments  can  now 
be  readily  computed  and  the  central  limit  theorem  can  be  applied  precisely  in  the 
same  manner  as  above.  The  specific  details  are  omitted. 

Example  3,  This  example  is  introduced  to  give  some  Indication  of  the  sensitivity 

of  the  probability  of  survival  to  changes  in  the  parameters  f)  and  . 

>!< 

It  is  apparent  from  (  31)  that  we  can  choose  6  so  that  M  <  p  and 

>  1-t  for  any  «  >  0,  so  that  the  probability  of  survival  will  not  exceed  <•  . 

>!< 

Now  reduce  6  so  that  M  =  p  and  hence  M  plays  no  role  in  the  analysis. 

>!< 

Then,  the  damage  per  explosion  exceeding  pQ  is  bounded  by  p  -  pQ  and  the 

proportion  of  explosions  that  exceed  pQ  is  given  by  1  -G(  pQ) .  Thus,  the  average 

>!< 

total  cumulative  degradation  can  not  exceed  K(p  -  pQ)(l  - G  (pQ))  which  for  suitable 

)jc 

choice  of  pQ  ,  can  be  made  less  than  D  ;  and  hence  Pr{T}  can  be  made 
arbitrarily  close  to  unity. 
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0. 


Estimating  the 'Probability  o£  Survival  when 


<r,  <r 


2 

*» 


From  (2),  we  note  that  the  marginal  distribution  of  r,  f^(r)  cannot  be 
obtained  in  closed  form  in  this  oaso,  and  consequently,  the  marginal  distribution 
of  ?  can  not  be  written  ;n  closed  form  either.  Hence,  in  this  case,  we  must 
resort  to  numerical  integration.  This  section  will  therefore  be  devoted  to  a 
brief  discussion  of  our  ideas  in  this  direction,  and  to  the  manner  in  which  they 
may  be  exploited  to  obtain  estimates  of  the  probability  of  survival. 

2  2  2 

Consider  the  integrand  on  the  right  of  (2),  and  note  that  for  or^  emax^,®^), 
we  have 


2  2. 
r  .cos  0 

2  '  2 

'1 


+ 


Sin  6 


>  > 


2  2  2 
r  ,  cos  6+  sin  8 

2  '  2 
c r« 


2  2 

Thus  for  r  /a-  Q  sufficiently  large,  the  Integrand  on  the  right  of  ( 2)  does  not 
provide  any  appreciable  contribution  to  f^r),  end  for  purposes  of  Integration, 
we  can  replace  f^(  r)  by  ssero. 

in  brief,  for  any  bounded  function  g(r),  there  is  a  real  number  S,  such 
that  we  can  replace 

oo  s 

/  g(r) f,< r) dr  by  /  g(r)f.(r)dr  . 

0  1  0  1 

We  will  therefore  evaluate  f^r)  numerically  for  a  sufficiently  dense  set 
of  r  values,  0  <  r  <  S,  so  that  integrations  of  the  type  denoted  above  can  be 
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evaluated  by  numerical  methods  (  Simpson's  rule,  for  example)  ,  with  sufficient 
accuracy  for  our  purposes. 

Since  the  integrand  in  (  2)  depends  on  0  only  through  sin2®  and  cos2® 
for  each  fixed  r  we  can  choose  a  uniformly  spaced  and  sufficiently  dense  set 
of  9  values  in  0<®<  ir/2  to  evaluate  f^(r)  numerically. 

Then,  using  (11),  we  compute 


Pr{P<  P0}  =  /  f^( r) dr 


Pr{P  <  M  }  =  /  fj(  r)  dr 


where 


[(®/M)1/v-d2]1/Z  , 


M  <  ®d 


M  >  ®d 


[(e/Pn)i/v-d2]1/2 


0  <  pQ  <  min(  M,  0d 
otherwise  . 


We  now  proceed,  much  in  the  same  manner  as  in  section  3,  by  applying 
the  central  limit  theorem  to  the  random  variables  D^,  D 2, . , , ,  ,  and  therefore 

obtaining  the  analogue  of  ( 30} .  In  order  to  do  so,  it  is  necessary  to  compute 
the  conditional  first  two  momenta  of  the  degradations  D^,  given  that  P  <  M  . 
Hence,  we  readily  have  that 


Hence 


E{D*|P  <  M}  =  E{Dk|R  > 


'^/2(P-P0)kfl{r)drJ^/£l(r)drJ  1  . 


43)  E{Dk|p  <  M}  f  (e(r2+d2)"  Y  -  p 


Q]kfI(r)dr|^/fl(r)  dr^ 


jj. 

Designating  E{D|P<M}  tty  and  E{D  |P  <  M}  by  P2,  we  can  now  write 
the  analog  of  (  30) ,  i.  e. , 

f  ^-Kp  >  K 

(44)  Pr{T)  “  #  -7— ™  (Pr{P  <  M}  )N  . 

WK((32- <  Px) 

Two  suggestions  for  applying  the  methods  of  sections  2  and  3  have  been 
considered.  In  one  of  these,  we  compute 

00 

E(R)  ■  /  rUr)  dr 
0 

and  equate  this  to  ~  *Tzn  <r  .  Then,  the  solution  for  v  , 

(45)  <r  »2E{R)/n'2 tT  -  0.  85776  E(R) 

can  be  used  in  ( 3)  to  obtain  an  approximation  to  f^(r  )  which  avoids  the 
complications  of  this  seotion. 
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Alternatively,  one  may  consider  finding  R  such  that 

p* 

/  f,(r|dr.i 
0 

III 

and  equate  R  to  the  median  of  f^r)  in  (3),  i.  e.  set 
(46)  R*2  ■  2(10g2)<r2 

and  use  the  value  of  <r  2  thus  obtained  in  (3).  Some  numerical  comparisons 
have  been  made  between  the  results  of  (44)  and  those  obtained  by  using  (45) 
and  (46)  to  simplify  the  problem. 

It  was  noted  that  the  discrepancies  are  substantial,  suggesting  that  the 
two  proposed  approximations  are  not  very  good.  A  careful  examination  of  the 
discrepancies  shows  that  the  estimation  of  Pr(  P  <  M)  is  fairly  good  for  a 
single  explosion,  but  the  exponentiation  for  K  explosions  produces  large  errors 
this  phenomenon  was  previously  noted  in  section  4. 


n  _ i _  if _ i ..  1.1  . 

tntuuuju  vouauiv 

In  the  preceding  sections,  it  was  tacitly  assumed  that  K,  the  number 
of  missiles  that  ponatiate  the  defenses  and  explode  in  the  neighborhood  of  the 
target  is  a  fixed  quantity.  Tho  purpose  of  this  section  is  to  give  some  idee  of 
the  extent  to  which  the  probability  of  survival  of  the  target  is  affected  by  allowing 
K  to  be  a  random  variable  rathor  than  a  fixod  quantity.  To  this  extent  a  number 
of  computations  have  been  performed  in  which  K  is  a  random  variable  with  a 
probability  distribution  p^(k),  where 

Pr{k  missiles  explode  |  N  sent}  =  Pr{K“k}  *pN(k)  . 

We  consider  two  possible  models  which  lead  to  the  following  different  choices 
©f  P^( k)  • 

(i)  the  binomial  distribution 
\  ii)  a  mixture  of  two  binomial  distributions 

(i)  Assume  tho  probability  thot  each  missile  explodes  remains  the  same 
for  ail  missiles,  and  that  whether  a  given  missile  explodes  or  not  is  independent 
of  tho  performance  of  any  othor  missile  thot  is  sent.  We  denote  the  constant 
probability  that  a  missile  explodes  by  r  and  henoe  We  have 

(47)  P{K-k)  «  pN(k)  .  (  J)  rk  <1  -  r>  N“R  . 

Then,  the  probability  of  survival  is  given  by 

(4fi)  Pr  (T>  f!  pM(k)  Pr  (Survival  |K- k} 

k-0  N 
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Some  numerical  computations  have  been  made,  ior  N  =  17,  r  =  .  7.  cind  tn  whic  h 
(  30)  ha'  been  used  to  estimate  Pr^Survival  I  K  =  ( 17 )< .  7)  =  11.  9},  which  is  the 
expected  value  of  K.  These  comparisons  were  made  ior  fc  selected  choices  of 
0  and  two  choices  of  D  ,  leaving  o,  M ,  and  pQ  fixed  throughout  ( 12  compari¬ 
sons  in  all).  Over  the  set  of  comparisons,  it  was  noted  that  the  maximum 
difference  between  the  probability  of  survival  computed  using  (  48)  and  the 
probability  of  survival  computed  using  {  30)  was  .  015,  suggesting  that  the 
approximation  using  (  30)  may  be  quite  good  for  a  fairly  large  range  of 
parameter  values. 

* 

It  has  also  been  noted  that  the  approximation  tends  to  improve  as  D 
increases.  This  is  fairly  natural,  since  the  central  limit  approximation  employed 
in  (  30)  will  tend  to  become  more  accurate  as  D  increases. 

(ii)  It  is  natural  also  to  envision  circumstances  in  which  the  probability 
of  a  missile  exploding  may  change  as  the  circumstances  governing  the  defense 
of  the  target  change.  Suppose  that  if  the  defenders  have  been  warned  ( for 
instance,  with  respect  to  the  direction  of  approach  of  the  missiles  by  the  DEW 
line)  then  the  defenses  can  eliminate  about  15  out  of  a  fliyht  of  17  missiles  on 
the  average,  but  if  they  are  not  warned  they  can  eliminate  only  about  2  of  them 
on  the  average.  Suppose  further  that  the  chance  of  getting  such  a  warning  is 
approximately  25% 

In  general,  from  this  point  of  view,  we  will  get  a  mixture  of  two  binomial 
distributions,  1.  e. 

(49)  PN(k)  “U^rfd-r/^  +  d-tH^r^l-r^^ 

where  0  <  4  <1.  For  the  above  set  of  circumstances,  we  would  have  4  =  .  25, 


H 
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and  for  N  =  17,  as  in  part  ( 1)  we  could  take  r^  ~  .-12  and  r^  ~  .  88.  (  48) 
applies  with  p  (  k)  as  given  in  (49  )  • 

N 

To  approximate  in  this  case  using  (  30),  we  can  evaluate  (  30)  numerically 
for  K  *  Nrj  and  for  K  *  Nr2,  and  then  average  these  two  results  with  weights 
4  and  1-4  respectively.  The  nifmerica I  comparisons  which  have  been  made 
suggest  that  this  recommendation  should  have  wide  applicability. 


SIMON  AWARDED  1966  WILKS  MEMORIAL  MEDAL 


Major  General  Leslie  E.  Simon  (Ret. )  received  the  1966  (second) 
Samuel  3.  Wilks  memorial  Medal  during  the  Twelfth  Annual  Conference 
on  Design  of  Experiments  in  Army  Research,  Development  and  Testing, 
which  was  held  at  the  National  Bureau  of  Standards,  Gaithersburg, 
Maryland,  19-21  October  1966.  General  Simon  has  long  been  recognized 
both  on  a  national  and  international  basis  for  his  outstanding  contributions 
to  Army  statistics,  reliability,  quality  control  and  promotion  of  statis¬ 
tical  activities  generally.  General  Simon  was  a  long-standing  friend  of 
Sam  Wilks  and  conferred  with  Sam  on  many  statistical  problems  and 
activities. 

The  Wilks  Award  is  given  each  year  to  a  statistician  and  is  based 
primarily  on  his  contributions,  either  recent  or  past,  to  the  advance¬ 
ment  of  scientific  or  technical  knowledge  in  Army  statistics,  ingenious 
application  of  such  knowledge,  or  successful  activity  in  the  fostering  of 
cooperative  scientific  matters  which  coincidentally  benefit  the  Army, 
the  DOD,  and  the  Government,  as  did  Samuel  S.  Wilks  himself. 

Dr.  Frank  E.  Grubbs  received  the  initial  Wilks  Medal  in  November 
1964,  and  Dr.  John  W.  Tukey  of  Princeton  University  received  the  first 
Wilks  Memorial  Medal  in  October  1965  at  the  Eleventh  Design  of  Experi¬ 
ments  Conference. 

, 

The  Award  consists  of  a  medal,  with  a  profile  of  Professor  Wilks 
and  the  name  of  the  Award  on  one  side,  and  the  seal  of  the  American 
Statistical  Association  and  the  name  of  the  recipient  on  the  other  side; 
an  honorarium  related  to  the  magnitude  of  the  award  funds  donated  by 
Mr.  Rust;  and  a  citation. 

With  the  approval  of  President  Frederick  F.  Stephan  of  the  American 
Statistical  Association,  the  Wilks  Award  Committee. for  1966  consisted  of: 

Professor  Robert  E.  Bechhofer,  Cornell  University 

Dr.  Francis  G.  Dressel,  Duke  University  and  the  Army 
Research  Office -Durham 

Dr.  Churchill  Eisenhart,  National  Bureau  of  Standards 

Professor  Oscar  Kempthorne,  Iowa  State  University 

Dr.  Alexander  M.  Mood,  U.  S.  Office  of  Education 

Dr.  Frank  E.  Grubbs,  Ballistic  Research  Laboratories, 

Aberdeen  Proving  Ground,  Maryland  --  Chairman 


The  citation  lor  Uic  second  WiiKs  medalist  reads  as  follows: 


"To  Major  General  Leslie  E.  Simon  for  hie  pioneering 
contributions  to  Quality  Control,  Sampling  Inspection, 

Reliability  and  Army  Design  of  Experiments,  and  for 
his  timely  promotion  of  statistical  activities  which  have 
benefited  not  only  the  Army  but  our  government  i*nd 
country  as  well.  " 

General  Simon  received  the  second  Wilks  Memorial  Medal  at  the 
banquet  of  the  Twelfth  Design  of  Experiments  Conference,  the  presenta¬ 
tion  being  made  by  President  Frederick  F.  Stephan.  General  Simon 
replied  as  follows; 

"President  Stephan,  Chairman  Grubbs,  ladies  and  gentlemen: 

I  am  most  grateful  for  the  honor  that  our  Association  has  seen  fit  to 
bestow  upon  me.  However,  1  am  primarily  a  professional  soldier; 
and  secondarily  a  statistician.  Thus,  1  had  difficulty  in  rationalizing 
the  bases  on  which  my  colleagues  came  to  the  conclusion  that  one  of 
my  statistical  attainments  should  be  so  honored. 

"While  considering  this  matter,  I  happened  to  read  a  letter  from 
Alfred  S.  Romer,  President  of  the  American  Association  for  the 
Advancement  of  Science,  that  was  published  in  the  September  issue  of 
the  Bulletin  of  that  association.  Two  paragraphs  of  that  letter,  I  believe, 
not  only  explain  the  place  of  the  AAAS  in  the  whole  regime  of  the 
scientific  community,  but  by  analogy  apply  equally  to  the  very  important 
role  of  the  American  Statistical  Society  in  the  large  and  diverse  field  of 
statistics.  Additionally,  these  paragraphs  may  be  of  some  application 
to  individuals.  I  would  like  to  read  to  you  these  two  paragraphs. 

'When  the  Association  was  founded,  well  over  a  century 
ago,  all  American  scientists  could  --  and  did  --  meet  in  one 
■mall  hall;  in  those  days  specialization  had  not  advanced  far 
in  any  field,  so  that  an  astronomer,  a  chemist,  a  botanist 
could  all  talk  more  or  less  understandably  to  one  another. 

But  the  number  of  American  scientists  grew  constantly  and 
specialization  increased,  creating  a  babel  of  often  mutually 
unintelligible  scientific  tongues.  In  consequence,  a  centrifugal 
process  set  in:  special  societies  in  various  fields  were 
established;  and  with  the  continual  increase  in  number  of 
scientists,  it  long  ago  became  impossible  for  any  city  in  the 
country  to  accommodate  at  one  time  all  the  members  of  all 
scientific  groups. 
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•Although  many  major  societies  low  meet  separately,  the 
annual  meetings  of  the  AAAS  still  include  technical  sessions 
in  nearly  every  area.  Most  important  is  the  fact  that  the 
Association  i_s  the  one  organization  appropriate  for  symposia 
and  conferences  in  interdisciplinary  areas.  Still  further, 
there  are  many  subjects  of  common  interest  to  scientists  of 
every  sort  (government  relations  to  science,  for  example), 
and  the  AAAS  is  the  appropriate  forum  for  discussion  of  such 
problpms.  1 

"The  centrifugal  process  described  by  Dr.  Romer  surely  took  place 
in  the  science  of  statistics  quite  as  much  as  in  any  field  of  science. 
Furthermore,  the  American  Statistical  Association  is  the  one  organiza¬ 
tion  that  binds  together  the  common  interests  of  all  the  specialized 
statistical  organizations.  About  twenty  years  ago,  I  had  the  honor  of 
being  a  member  of  an  ad  hoc  committee  appointed  to  consider  the  future 
of  ASA  and  it  recommended  that  rendering  this  service  should  be  a  goal 
of  ASA. 

"The  implication  of  Romer 's  remarks  to  individuals  is  one  additional 
step  in  logic.  The  number  of  statisticians  has  increased  enormously, 
during  the  last  quarter  century,  along  with  concomitant  gains  in  powerful 
statistical  tools  and  increased  recognition  of  the  importance  of  Statistics. 
One  who  enters  a  field  while  it  is  in  a  rapidly  expanding  stage  naturally 
has  more  opportunities  for  identifiable  achievement  than  one  who  enters 
after  it  has  become  mature  and  more  densely  populated.  In  a  mature' 
activity,  one  exchanges  some  of  the  challenges  of  pioneering  for  the 
important,  but  less  conspicuous  satisfactions  of  pleasurable  cooperative 
work  with  colleagues,  the  enjoyment  of  more  sophisticated  techniques 
and  pride  in  the  perfection  of  one's  work. 

"Timing  one's  entry  into  a  field  is  only  slightly  more  practicable 
than  making  a  judicious  selection  of  one's  ancestors,  as  sometimes 
recommended  by  the  medical  profession.  I  made  np  choice.  However, 
due  to  the  need  for  better  methods  for  solving  Army  problems,  I  happened 
to  begin  work  in  Statistics  relatively  early  and  under  favorable  circum¬ 
stances  for  ready  identification;  and  I  cannot  escape  the  belief  that  the 
perspective  of  my  work  is  enhanced  by  a  rather  chance  sequence  of  events 
similar  to  that  described  by  Dr.  Romer.  Thus,  I  am  doubly  grateful; 
first,  for  the  pleasure  and  satisfaction  of  working  in  a  most  engaging  and 
rewarding  field,  and  second,  for  the  generous  recognition  awarded  me  by 
my  colleagues.  " 
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SINGLE  DEGREE  OF  FREEDOM  ORTHOGONAL  COMPONENTS 

OF  A  FACTOR  AT  2K  LEVELS  IN  TERMS  OF  LINEAR 

rp t ttt*  iK  r  AMdin  a  p<tc 

iwntj  xiiiui  &.  uv'ii  *  a  m  vi 

K  FACTORS  AT  2  LEVELS 


Joseph  Weinstein 

Electronic  Components  Laboratory 
U.  S.  Army  Electronics  Command 
Fort  Monmouth,  New  Jersey 


INT RODUC T ION .  F.  Yates  (1937)  algorithm  for  resolving  a  set  of 
2^  observations  from  a  factorial  experiment  on  K  2-level  factors  has 
many  desirable  properties  for  the  data  analyst.  Briefly,  it  is  easily 
learned,  readily  programmed  on  a  computing  machine,  requires  only 
the  simplest  arithmetic  operations  which  limit  the  possibility  of  "blunders", 
and  it  gives  estimates  of  the  effects  of  all  K  experimental  factors  singly 
and  their  joint  effects  2  or  more  at  a  time. 


To  illustrate  the  algorithm  for  those  unfamiliar  with  it  we  consider 
the  set  of  numbers 

i  =  1.  2,  3 . 2K 


(1) 


j  =  0,  1,  2 . K 


where  X^  represents  the  input  data  obtained  from  the  factorial  experiment 
and  is  structured  in  the  standard  order. 


Then  one  computes 


,(K-1) 


(2) 


X.  ... 
i,  J+l 


X,,.  .  .  +  X,.  .  :  1=1,2, ...  2' 

(2i- 1) ,  j  2i,j 

-X  K  +x  *  .  i-2(K-D+1  2(K-1)  2 

X(2i-1-2K), j  +  X(2i-2K),j  •  i_Z  +1,  2  +2, 


.  .  ,2 


K 


iterating  until  j  *  K-l  so  that  the  contrast  vector  X^  has  been  computed. 

For  K  =  2  and  the  treatment  factors  A  and  B  the  input  data,  Xio  can  be 

represented  by  the  treatment  combinations:  (1),  a,  b,  ab;  where  the  non- 
appearance  of  the  small  letter  (a)  implies  that  one  of  the  levels  of  that 
factor  (A)  was  included  in  the  conditions  giving  rise  to  that  observation  and 


the  appearance  of  the  small  letter  (a)  implies  that  the  other  level  of  the 
factor  (A)  was  present.  Then  application  of  the  algorithm  gives 


X. 

io 

^12 

(1) 

(l)  +  a 

(l)+a+b  +  ab 

a 

b  +  ab 

-(1)  +  a  -  b  +  ab 

b 

-(1)  +  a 

-{!)  *  a  +  b  +  ab 

ab 

-b  +  ab 

(1)  -  a  -  fc  +  ab 

The  entries  of  the  last  column  will  be  observed  to  be  respectively  the 


(3) 


sum  of  the  input  observations 
the  contrast  of  the  A  factor 
the  contrast  of  the  B  factor 
the  interaction  contrast  of  A  and  B 


Where  an  experimental  design  involves  one  or  more  factors  which  are 
varied  over  a  number  of  levels  which  is  a  power  of  2  we  shall  show  that 
the  computational  advantages  of  Yates  Algorithm  can  still  be  retained  in 
the  data  analysis  b/  relating  linear  combinations  of  the  results  obtained 
to  the  desired  factor  effects.  For  a  factor  at  2^  levels  these  desired 
factor  effects  are  of  course 


(4) 


the  sum  of  the  observations 
Linear  effect  of  the  factor 
Quadratic  "  "  "  " 

Cubic  "  "  "  " 


Thus  for  a  factor  X  which  is  to  be  varied  over  4  levels  K  =  2  and  our 
interest  is  in  the  first  four  rows  above  and  we  shall  examine  how  they 
relate  to  the  four  rows  of  (3)  which  involve  2  factors  A  and  B  each  at 
2  levels. 

Practical  Solutions  and  General  Solutions.  Given  that  we  will  repre¬ 
sent  the  four  levels  of  an  experimental  factor  W  by  the  four  treatment 
combinations  available  from  2  dummy  factors  A  and  B  which  range  over 
2  levels,  the  question  of  assignments  arises  (for  this  can  be  made  in 


4!  ways).  These  24  possible  assignments  are  detailed  in  Figure  I  - 

"O  Matrices"  which  relates  the  levels  of  W  (w, ,  w„,  w,.  w.)  to  the 

12  3  4 

treatment  combinations  of  A  and  B  by  the  appearance  of  a  1  in  the  row 
and  column  and  0  elsewhere. 

For  example 

(1)  a  b  ab 

w1  “  i  o  r 

w^  10  0  0 

w3  001  0 

w^  0  0  0  1 

4 

assigns  w^  to  a,  w^  to  (l),  to  b  and  w^  to  ab. 

It  is  immediately  apparent  that  practical  limitations  will  constrain 
a  catalog  of  relationships  to  the  case  of  a  factor  at  4  levels.  For  the 
next  step  would  involve  K  *  3,  or  a  factor  at  8  levels,  and  8!  (“40,  320) 
possible  ways  of  assigning  the  factor  levels  to  the  eight  treatment  com¬ 
binations  ( 1 ) ,  a,  b,  ab,  c,  ac,  be,  abc.  However,  the  constraint  is 
certainly  not  severe  in  the  case  of  experiments  with  physical  factors  since 
the  investigator  is  rarely  concerned  with  effects  higher  than  cubic. 
Furthermore  the  difficulty  arises  from  the  desire  to  catalog  all  cases, 

But,  if  an  arbitrary  assignment  is  made  between  the  factor  at  2^  levels 
and  the  treatment  combination  of  K  dummy  factors  at  2  levels  each,  then 
the  following  argument  applies  in  general  for  all  values  of  K  for  that 
assignment. 

The  General  Solution.  Given  a  factor  W  at  2  levels  and  K  dummy 
factors  F^,  F^,  F^  each  at  2  levels  which  can  represent  2^  treatment 

combinations  of  a  full  factorial  experiment  we  may  represent  the  assign¬ 
ment  of  treatment  combinations  to  the  levels  of  W  as  a  matrix  equation. 

(5)  W  =  OX 

T 

where  W  is  the  vector  (w^,  W£,  .  .  .  ,  w^k) 

K  K 

O  is  a  2  by  2  matrix  of  1 ' s  and  0's  such  that  only 
one  1  can  appear  in  any  row  or  column 
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X  is  a  vector  of  treatment  combinations  in  standard 
order  for  the  K  2-level  factors 


(1),  fj,  f2,  fjf2.  iy  £^3 . £■ 


k’ 


flf2f3'  '  '  fK 


We  represent  the  computations  of  the  Yates  Algorithm  by  a  matrix 
operator  N  which  is  also  2*  by  2^  and  define  the  results  of  operating  on 
the  input  observations  by  N  as  Y  (the  contrast  vector  for  the  sum  of 

all  the  observations,  the  K  factor  effects  F. ,  F_ ,  ...  F,  and  the 
K  t  2  K 

2^  «K  -1  joint  effects  of  two  or  more  factors).  Thus, 

(6)  Y  =  NX 


since  N  *  Y  =  X  equation  (5)  yields 

(7)  W  =  ON-1  Y 


Consider  that  a  direct  method  of  operating  on  W  by  some  matrix  operator 
M  which  would  yield  the  desired  vector  of  factor  effects  (such  as (4)  ) 
could  be  represented  by  O.  Then 

(8)  n=MW  =  MON_1Y  (From  (7)  )  . 


Such  direct  operators  M  do  exist  and  are  in  fact  the  contrast  coefficient 
vectors  to  be  found  in  tables  of  orthogonal  polynomials  (usually  limited  to 
components  of  5th  degree  or  less). 

Since  M  and  N  exist  and  Ocan  be  cataloged  for  K  =  2  (or  assigned 
arbitrarily  for  K  >  2)  it  is  possible  to  evaluate  MON"^  for  all  values  of  O 
and  thus  define  the  linear  combinations  of  components  of  Y  which  corre¬ 
spond  to  the  desired  components  of  the  real  factor  W. 

The  procedure  is  illustrated  for  K  =  2  and  particular  values  of  O 
selected  from  the  catalog  of  24  possible  values  of  O  given  in  Figure  I. 


For  K  =  2  M  and  N  are  given  by 


(9)  M  = 


/  1  1  1  1  \ 

-3-1  13 

1  -1  -1  1 

^-1  3-3  1  / 


- 1  T 

then  N  can  be  shown  to  be(L/4)N  and  the 

1,  and  7  will  be  used  to  evaluate  MON-1 


special  cases  for  0  labelled 


n 

0 

0 

o" 

"l 

0 

0 

(f 

0  1 

0 

0 

mc^n'1  = 

0 

1 

2 

0 

0 

0 

1 

0 

0 

0 

0 

1 

_0 

0 

0 

1 

0 

2 

-1 

0 

•w 

Thus  0  =  MOjN"*  Y  has  components 


W^, :  sum  of  the  input  date  =  sum  of  the  input  data 

W^:  linear  contrast  for  W  =  dummy  contrast  A  plus  twice 
dummy  contrast  B 

W0:  quadratic  contrast  for  W  =  interaction  contrast  for 
dummy  factors  AB 


:  cubic  contrast  for  W  =  twice  dummy  contrast  A  minus 
dummy  contrast  B  . 


. . G 

and  the  components  of  ft  are  defined  as 

W^, :  sum  of  the  input  data  e  sum  of  the  input  data 

W.  :  linear  contrast  for  W  =  twice  the  dummy  contrast  B  plus  the 

interaction  contrast  of  dummy  factors  AB 
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Wq:  quadratic  contrast  for  W  =  dummy  contrast  A 

VV  ;  cubic  contrast  for  W  =  negative  of  dummy  coni.ra.tti  5  plus  twice  the 

interaction  contrast  for  dummy  factors  AB. 

The  complete  catalog  of  such  relationships  for  the  24  possible 
assignment  matrices  O  is  given  in  Figure  II  "W  components  of  MON"*  Y.  " 

Finally,  the  equivalence  of  results  obtained  by  this  Extended  Yates 
procedure  and  by  conventional  procedures  for  obtaining  single  degree  of 
freedom  contrasts  for  experimental  data  is  detailed  in  Figures  III,  IV  and 
V  for  the  case  K  =  2  and  assignment  matrix  O.  Here  we  consider  a 
factorial  experiment  in  16  runs  where  a  factor  W  is  at  4  levels  and  two 
factors  C  and  D  are  each  at  2  levels,  First  consider  the  conventional 
procedure.  Figure  III  in  column  heading  "real"  lists  the  standard  order 
of  the  real  treatment  combinations  (in  practice  this  would  be  the  column 
of  observations  obtained  from  the  experiment).  The  16  columns  at  the 
right  list  the  coefficients  for  multiplying  the  input  observation  on  the 
same  row  such  that  the  sum  of  products  of  the  input  observation  by  its 
coefficient  estimates  the  factor  contrast  named  at  the  head  of  that  column. 

Similarly,  Figure  IV  also  develops  single  degree  of  freedom  contrasts 
from  an  experiment  on  16  runs  assumed  to  be  a  factorial  experiment  on 
four  2-level  factors  A,  B,  C,  and  D.  Here  the  16  columns  at  the  right 
list  the  coefficients  required  to  obtain  the  contrasts  named  at  the  head 
of  the  respective  columns. 

Extended  Yates  Procedures.  Figure  V  is  the  result  of  combining 
the  columns  of  Figure  IV  according  to  the  rules  of  row  1  of  Figure  II 
(since  O,  was  used  to  assign  W  to  dummy  factors  A  and  B),  It  is  observed 
that  these  16  columns  are  exactly  equivalent  to  those  in  Figure  III. 
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r  ■  '  ■" 

! 

! 

O  -  matrices 

1 

available  for  assignment  of  A  and  B 

1 

treatment  combinations  to  levels  of  W 

i 

* 

i 

2 

% 

4 

5 

6 

10  0  0 

10  0  0 

10  0  0 

10  0  0 

10  0  0 

10  0  0 

* 

0  10  0 

0  10  0 

0  0  0  1 

0  0  0  1 

0  0  10 

0  0  10 

■ 

0  0  10 

0  0  0  1 

0  0  10 

0  10  0 
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CONDITIONAL  EFFECTS  AND  INTERACTIONS 
IN  SYMMETRICAL  FACTORIAL  CONFOUNDING 
WITH  APPLICATION  TO  BIOLOG  r 

N.  R,  Bohidar 

Biomathematics  Division,  Fort  Detrick,  Frederick,  Maryland 
C-E-I-R,  Inc.  ,  Los  Angeles,  California 

INTRODUCTION.  The  conditional  effects  and  interactions  (CE&I's) 
associated  with  a  factorial  experiment  have  the  property  of  establishing 
direct  and  reciprocal  relationship  among  the  various  main  effects  and 
thereby  improving  the  interpretative  information  of  such  effects.  They 
have  also  the  virtue  of  alleviating,  to  some  extent,  the  broad  problems  of 
interpretation  of  higher  order  interactions  such  as  four  or  more  factor 
interaction,  contrasts  o£  the  type  linear  x  quadratic  x  linear  x  cubic  or 
3  2 

AB  C  ,  etc.  by  assigning  appropriate  interpretation  to  their  respective 
conditional  entities.  In  this  treatise  the  concept  of  conditional  effects 
and  interactions  is  introduced  in  consistence  with  the  general  theory  and 
modulo  notation  associated  with  symmetrical  factorial  experiments.  The 
treatment  consists  of  algebraic  definitions,  determination  of  conditional 
effects  and  Interactions  for  a  given  situation  and  orthogonal  partition  of 
sums  of  squares  in  general  anova  procedures.  The  problem  of  estima- 
bility  of  the  CE&cI's  under  classical  confounding  have  been  considered. 
Simple  rules  have  been  developed  for  rapid  examination  of  the  estimability 
of  the  CE&I's  under  confounding  conditions  by  the  application  of  elementary 
operations  of  theory  of  sets.  The  interpretation  of  CE&tl's  is  explained 
by  a  numerical  example  from  a  biological  experiment.  The  topics  such 
as  conditional  confounding  and  its  impact  on  fractional  replication,  frac¬ 
tional  factorial,  asymmetrical  factorials,  etc.  ,  are  not  presented  in  this 
treatise.  The  theorems  and  the  proofs  are  heavily  based  on  the  properties 
of  finite  geometries  derived  from  Galois  fields  and  finite  projective  and 
Euclidean  Geometry  and  combinatorial  theorems.  No  proofs  will  be  given 
here.  Only  the  definitions  and  the  salient  properties  will  be  described. 

The  notation  will  be  consistent  with  the  general  factorial  notation. 

DEFINITION  OF  GEM'S  AND  PROBLEM  OF  ESTIMABILITY.  The 
CE&tl's  are  generated  by  decomposing  the  total  dimension  of  the  factor 
space  into  interpretable  dimensions  and  it  is  expected  that  the  problem  of 
estimability  of  such  effects  and  interactions  becomes  an  immediate  concern. 
The  mathematical  theory  of  factorial  experimental  design  follows  directly 
from  the  theory  of  linear  models  based  on  the  Gauss-Markoff  theorem 
which  states  that  Y,  the  response  vector  with  n  components  if  expressed 
in  terms  of  the  following  linear  model, 


Y  =  X(3  +  e 


where  (J  is  the  column  vector  of  p  unknown  parameters,  X  is  the  design 
matrix  of  dimension  n  x  p  and  e  is  the  error  column  vector  with  n 
components,  then  the  best  linear  unbiased  estimator  of  (3  is, 

$  =  (X'X)‘AX'Y 


obtained  from  the  solution  of  the  following  normal  equations 

X'X$  *  X'Y 

where  X'X  is  non -singular  and  there  is  a  unique  inverse  associated  with 
X'X.  But  the  factorial  experimental  design  matrix  is  not  always  of  full 
rank  and  so  one  is  interested  in  Investigating  the  conditions  of  estimability 
of  a  linear  function  of  the  |3's  such  as  A'P  where  A  is  a  column  vector  with 
p  components.  Now  let  A'P  be  estimated  by  a'y.  One  proceeds  to 
minimise  the  variance  of  a'y  =  a'aff  -  under  the  condition  of  unbiasedness, 
a'X  =  A'  with  p  constrains  by  the  use  of  the  Lagrangian  Multiplier  p.  Now 
solving  the  equations  dQ/da.^,  i  =  1,2. .  .n  and  where  Q  is  the  expression 

to  be  minimized  we  have  a  =  Xp  or  X'Xp  «  A.  This  equation  provides  one 
with  a  condition  of  estimability  which  states  that,  if  there  exists  a  p  such 
that  X'Xp  »  A,  the  coefficient  of  the  linear  function  of  the  p's,  then  A'P  is 
estimable.  In  defining  CE  and  I's,  the  conditions  of  estimability  are 
appropriately  incorporated  into  the  definition,  and  if  one  follows  the 
definitions  and  the  constrains  associated  with  the  definition,  the  problem 
of  estimability  does  not  arise. 

In  consistence  with  the  normal  factorial  notation,  p  denotes  the  level 
of  the  factors  where  p  is  a  prime  number,  n  denotes  the  number  of  the 
factor  and  the  treatment  combinations  are  denoted  byXjX^.  .  .x^,  Xj,  i  =1 , 2.  ,.n, 

being  the  level  of  the  i**1  factor  where  x  takes  the  value  from  0  to  (p-1). 
There  are  pn  treatment  combinations,  there  are  pn-l  degrees  of  freedom, 
there  are  (pn-l)/(p-l)  contrasts  each  with  (p-1)  degrees  of  freedom  and 
each  contrast  with  (p-1)  degrees  of  freedom  is  associated  with  p-sets  of 
pn_l  treatment  combinations.  All  numbers  are  expressed  as  reduced 
modulo  p.  Confounding  for  pn  in  blocks  of  ps  requires  n-o  independent 
effects  or  interactions  to  be  confounded  along  with  all  generalized  effects 
and  interactions  with  a  total  of  (pn ” s -l)/(p -1)  effects  and  interactions  con¬ 
founded.  By  considering  the  modulo  definitions  for  the  (n-s)  independent 
effect  or  interaction  confounded,  one  can  generate  the  pn"s  blocks.  The 
total  degrees  of  freedom  confounded  is  (pn"8-l)  and  the  number  of  effects 
and  interactions  each  with  (p-1)  degrees  of  freedom  have  (pn”s-l)/(p-s) 
effects  and  interactions  confounded.  The  total  number  of  systems  of  con¬ 
founding  for  a  pn  experiment  in  blocks  of  p8  is  equal  to 
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Upn-1)  (pn-p).  •  .(pn-pr  )]  /  t(pn‘g-l)  (p: 


n-s  .  .  n-s  n-s-1,, 

-p).  ..{p  -p  )] 


.  Definition  I.  The  symbolical  representation  of  conditional  effect 

and  interaction  with  one  condition  is 

*  o,  a, 

(1)  As  ...  Na,1/iCj 

1  =  1(  ,  .  n  take  integral  valuer  between  0  and  p-1  and  by 

convention  first  f  0  ■  1,  the  is  any  factor  letters  A,  B,  ...  N  for 

which  its  corresponding  aR  {k  s  1, 2.  . .  n)  =  0  and  j  =  0,1,2,.  .(p-1). 

The  expression  in  (l)  defines  contrasts  among  p-sets  of  treatment 

•  combinations  satisfying  one  of  the  p  following  equations, 

n 

S  a  x  a  0  mod  p  /  x.  =  q 
i*l  1  k 


n 

21  a  x  s  1  mod  P  /  x.  »  q 
i=l  k 


n 

33  ai  xi  =  (p-1)  mod  P  /  x.  ■  q 

i=l  k 


where  a^s  take  integral  values  between  0  and  p-1 ,  k  =  l,2,...n,  q  takes 

integral  values  between  0  and  p- 1 ,  k  can  take  any  value  1 , 2,  ...  n  for 
which  ak  ■  0  and  it  refers  to  the  kth  coordinate  in  the  n-dimensional  space. 

If  p  «  3  and  n  *  4,  then  one  is  dealing  with  3*-case.  AB2/C  is 
estimable  since  ^  =  1,  a2  *  2,  a3  *  0  a4  =  0,  X  =  C  and  j  =  0  wi& 

congruential  equation  [xj  +  2xg  «  0,1,2  mod  i/x^  *  0]  . 

Definition  II.  An  effect  or  Interaction  conditioned  on  more  than  one 
effect  is  a  conditional  effect  or  interaction  with  multiple  conditions 
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(2) 
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In  the  case  of  3  the  conditional  A /«  /r  /d  is  c siimaule 

U  1  H 

since  =  1,  =  0  and  W  /  Y  /  Z.  For  this  case  the 

congruential  equation  is;  x1  =  0,1,2  mod  3/x2  =  0/x3  =  l/x4  =  2. 

Definition  III.  For  p>3,  one  is  interested  in  interpreting  effects 
and  interactions  in  terms  of  their  polynomial  effects  such  as  linear 
effect,  quadratic  effect,  cubic  effect,  etc.  The  linear  effect  is  defined 

as 


and  the  quadratic  effect  is  defined  as 


for  p  equally  spaced  levels  and  represents  the  itil  set  of  the  p  sets 

defining  the  contrast.  For  unequally  spaced  levels  q.,q.  , .  . .  q  . ,  the 
linear  effect  is  defined  as  01  Vl 

P-1  .  P-1  ? 

A  -  £  (q.  -  q)  A.  /  2  1q,  -  q) 

i=0  i= 1  1 

P-1 

and  A"  ■  Z  C  A 
i=l 

where  SC,  =  0  and  ZC.  {qi  -  q)  =  0  . 

The  same  line  of  argument  is  followed  for  the  other  higher  order 
polynomial  effects. 

Definition  IV.  The  expression  in  (1)  and  (2)  will  be  called  condi¬ 
tional  effects  and  interactions  (CE&I)  and  the  unconditioned  effects  and 

al  a2  “n 

interactions  of  the  type  A  B  . . .  N  will  be  called  classical  effects 
and  interactions. 

PROPERTIES  OF  CE  AND  I'S.  The  following  are  the  combinatorial 
and  statistical  properties  of  the  conditional  effects  and  interactions. 
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Property  No.  1*  The  total  number  of  conditional  effects  and  interactions 
and  the  classical  effects  and  interactions  for  a  given  p  and  n  is 

N(n.p)  =  C(2p)n  -  (p+l)n] 


where  n>2  and  p  is  a  positive  integer.  The  exact  number  of  CE  and  I's  are 

N'(n,p)  *  (2p)n  -  (p41)n  -  (pn-l)  . 

A  Table  of  N(n,p)  and  N'(n,p)  have  been  presented  in  the  Appendix. 
Property  No.  2;  Consider  an  effect  or  an  interaction,  denoted  by 
A  *B  .  .  N  n/W./Y^/.  .  */^m  defined  by  the  following  equations 

a,x,  =  0  mod  p/x,  =  a,/.  .  ./x  =  q 

ii  k  r  '  m  m 


a.x.  *  p-1  mod  p/x,  *  q ,/.  .  ./x  =  q 

i  i  ^  k  1'  m  m 


With  other  conditions  satisfying,  each  equation  satisfies 

r  n-l-k, 

[P  ] 

treatment  combinations  where  k  *  number  of  conditions  associated  with 
the  conditional  effects  and  interactions. 

Property  No.  3:  Consider  a  conditional  effect  and  interaction 

fti 

A  B  ...N  /W.,  where  W  is  any  factor  letter  A,  B, ...  N  for  which 

J 

a  s  0  and  j  ■  0,  1,2,.  .  .(p-1).  If  W  is  kept  fixed  and  the  congruential 
equations  associated  with  the  effects  and  interactions  are  solved  for  each 
value  of  j  b  0,  1,2,,  ..(p-1),  then  p  contrasts  are  generated  and  they 
are  mutually  orthogonal  contrasts. 


n 

2 

i=l 


n 

2 

i=l 


Property  No.  4;  Consider  a  conditional  effect  of  interaction  denoted  by 

A  2...N  n/”rj/ *  k/«  .  .  /Zm  with  in  conditions,  where  W,  ¥,  ...Z 

and  j,k, . .  ,m  all  satisfy  conditions  proposed  in  (2).  If  W,  Y, . .  .  Z  are 
kept  fixed  and  the  congruential  equations  for  the  effect  or  interaction  are 
solved  for  each  combination  of  the  values  cf  j,k, .  .  ,m,  then  pm  contrasts 
are  generated.  They  are  all  mutually  orthogonal  contrasts. 

&1  a2  an 

Property  No.  5:  Let  the  symbol  A  B  • . . .  N  denote  the  numerical 
totals  of  the  effect  or  interaction  under  consideration)  then  we  have 


P-1  a,  a  a  a  a 

2  (A  lA  .  ..N  n/W)  =  A  2...N  n  . 
J-0  J 


This  can  be  extended  to  the  case  with  multiple  conditions 


p-1  p-1  P-1  a,  a,  a 

r  2  ...  s  (a  b  .  ..n  n/w /y. /. . ,/Z  ) 

j=0  k=0  m=0  J 

ai  S  a„ 

■  A  B  *...N  . 


Property  No.  6:  For  p>3,  property  5  can  be  extended  to  the  polynomial 
effects,  such  as 

p-1  ana  a  a  a  , 

2  [{A  B  .  . .  N  n)  /W  ]  =  (A  lB  *...N  ) 

j"0  j 


p-1  a,  a.  a  a,  a,  a 

2  [(A  lB  Z...N  T/W.]  =  (A  lB  Z...N  V 

j.*0  J 


So  also  for  the  multiple  condition  case 

p-1  p-1  p-1  a.  a  a 

2  2  ...  2  [(A  lB  Z...N  n)'/W  /Yk/.../Zm] 

j»0  koO  m-0  J 


summed  over  p  terms 


a,  a,  a 
(A  lB  2.  ..N  n)'  . 
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,  ,  -aUa2 

UCb  JM.  JL> 


riuyciiy  t:  ucb  i_>  ,  .  .  n  oc  me  tuieu  oi  tnc  enect  or  inter- 

action  under  consideration,  then  the  sum  of  squares  for  the  conditional 
effects  and  interactions  (SS(CE*(I))  can  be  expressed  explicitly  as  follows 


(3) 


SS(CE8cI)  =  (A^A**2. . .  N°n^/Yk/ .  .  ./ZJ*  (rZ^2) 


-1 


where  r  is  the  number  of  replicates  and  X's  are  the  coefficients  such  that 
22  =  0.  If  X,  Y, . .  .  Z  are  kept  fixed  and  for  each  combination  of  the 

values  assumed  by  j,k, .  .  .m,  a  sum  of  squares  is  calculated  then  this 
pm-set  of  sum  of  squares  forms  an  orthogonal  set  for  the  analysis  of 
variance  tests.  The  conditional  sum  of  squares  can  be  expressed  in  terms 
of  the  combination  of  classical  sum  of  squares  as  follows 


p-1  p-1  p-1  a  a,  a  -  - 

22  22  ...  2  (A  1B  ...N  /X./Y  /.../Z  )/  (rZ2\.) 

j=0  k=0  m=0  k  m 

a.  a.  a  a  a  i  a  a  a 

=  SS(A  B  .  . .  N  n)  +  SS(A  *B  6.  . .  N  °X)  +  . .  .  +  SS(A  AB  . . .  N  nZ) 


where  the  definition  of  X,  Y, ...  Z  and  j,k, . .  .m  are  the  same  as  given 
in  (2).  It  is  also  noted  that  the  expression  in  (3)  generates  single  degrees 
of  freedom  contrast  sum  of  squares. 

CLASSICAL  CONFOUNDING  AND  ESTIM ABILITY  OF  CELI'S.  The 
problem  here  is  the  following: 

ai  a2  an 

Let  a  classical  effect  or  interaction  A  B  ...N  defined  by 
n 

2  a.x.  =  0, 1, 2, . .  .(p-1)  mod  p 
i-1  1  1 

be  completely  confounded  with  blocks,  then  what  are  the  conditions  of 
estimability  of  the  following  CEL  I 

a,  a,  a 

A  B  2.  ..N  n/W./Y ,/.  .  .Z 
'  y  k'  m 

n 

defined  by  2  a.x,  =  0, 1 , 2.  .  . (p- 1)  mod  p/xj  *  s  q2/« . .  Am  “  qm 

i=l 
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if 


where  the  a/s,  W,  Y, . . .  Z  and  j,k, . . .  m  meet  specifications  given  in  (2). 
The  problem  can  be  extended  to  cases  in  which  two  or  more  classical 
effects  or  interactions  and  their  generalized  interactions  are  completely 
confounded.  The  problem  reduces  to  the  fact  that  there  are  pn’8  blocks 
each  containing  pB  treatment  combinations  and  one  is  interested  in  finding 
a  contrast  among  pn"*“l  treatment  combinations  such  that  the  contrast  is 
orthogonal  to  pn“*  blocks. 

The  approach  to  the  problem  here  will  be  to  develop  rules  for  rapid 
examination  of  the  estimability  of  a  given  conditional  effect  or  interaction 
under  classical  confounding  based  on  simple  mathematical  manipulation. 
The  theorems  and  proofs  of  the  results  are  completely  omitted. 

In  this  problem  we  have  three  types  of  effects  or  interactions: 

(i)  Confounded  effects  or  interactions  and  their  generalized  interactions, 

(ii)  Conditioned  effect  or  interaction, 

(iii)  "Conditions"  (effects  used  as  conditions). 

Each  of  the  effects  can  be  represented  by  their  respective  coordinates  of 
the  factor  space. 

Let  r  be  a  finite  set  of  coordinates  in  the  n-dimensional  factor  space, 
r  =  {Xl,x2,...xn)  . 

Let  p  be  a  subset  of  T  containing  the  coordinates  associated  with  the 
confounded  effects  or  interactions 

where  j,  k,  m  =  1 , 2, .  . .  n  and 

P  -  P1Up2U...Upr 

where  ^'s  are  subsets  containing  coordinates  of  the  ith  confounded  effect 
or  interaction  out  of  r  such  confoundeded  effects  or  interactions,  p  is  then 
the  union  of  the  coordinates  of  r  confounded  effects  or  interactions  and 
their  generalized  interactions. 

Let  y  be  a  subset  of  T  containing  the  coordinates  associated  with  the 
conditioned  effects. 


y  =  {x.,x,,..  .x,.}  . 

»  j  **. 

Let  II.  be  a  subset  of  T  containing  the  coordinates  associated  with 
the  "conditions" 

ni  s  •*xk) 

where  -  Oj  ■  H^Un^.  *  *  U-II^r 

if  there  are  r  conditions  associated  with  the  conditional  effect  or 
interaction. 

Now  by  the  application  of  simple  rules  of  set  operation  we  derive  the 
following  new  quantities,  in  three  steps: 

Step  1.  5  =  pUy 

step  2.  n2  *  pUy 

Step  3.  6Q» 

The  conditions  of  orthogonality  to  pn_s  blocks  are  as  follows* 

(i)  If  6q«  6 

then  the  conditional  effect  or  interaction  is  not  orthogonal 
to  blocks 

(ii)  If  60/  6 

then  the  conditional  effects  and  interactions  under 
consideration  are  orthogonal  to  blocks  and  consequently 
e  stimable . 

Examples:  - 

4 

El.  Consider  2  case  and  confound  ABC,  BCD  and  their  generalized 
interaction  AD.  Then  the  conditional  effect  A/Cq  is  orthogonal  to  blocks, 
sinre  T  a  {x^ *2» *3. *4} .  P1  =  {x^x^x^,  P2  =  {x2,x3,x4},  P3  =  {x^x^, 

P  =  P1'-,P2'JP3  =  {x1,x2,x3lx4},  y  =  {Xj},  s  ,rx3},  5  =  {x1>(x2,x3,x4}, 

n2  a  {x1},  6q  =  (nLUn2)  fl  6  =  {xj.x3}  /  6. 
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£2.  Consider  the  same  case  as  in  El,  the  conditional  interaction 
AB/Cj/Dq  is  not  orthogonal  to  blocks  and  consequently  not  estimable 

Lccauoc  P  -  {x1>x2,x3,x4},  y  «•  {x^x^,  1^  *  {x3,x4>,  6  *  {x^ x2# x3> x4> , 

H2  *  {xl,x2},  «  {x1,x2,x3,x4}0{x1>x2iV3,x4}  and  by  the 

application  o£  the  idempotency  law,  6  ^  ■  6 . 

E3.  Consider  the  case  3^  and  confound  the  four  factor  interaction 
ABCD,  then  the  polynomial  conditional  interaction  A'ByC^/E^  is 

orthogonal  to  blocks  because  p  *  {xj,x2.x3,x4},  y  a  {x^x^},  * 

{x1,x2,x3,x4},  n2  •  {xj, *2) i  sQ  *  (n1un2)0s  *  {xl,x2,x3}  f  5. 

The  construction  phase  of  the  factorial  experimental  designs  involving 
conditional  effects  and  interactions  will  be  presented  separately.  The 
impact  of  confounding  of  conditional  effect  and  interaction  (conditional 
confounding)  on  the  structure  of  fractional  factorials,  fractional  replica¬ 
tion  and  asymmetrical  factorials  with  balanced  and  partially  balanced 
configuration  will  also  be  presented  separately. 

INTERPRETATION  OF  FACTORIAL  EXPERIMENTS.  When  an  effect 
is  conditioned  on  another  effect,  a  conditional  effect  is  generated.  One  of 
the  examples  of  such  effect  is  the  well-known  nested  effect  in  the 
hierarchical  classification  situation.  The  conditional  effects  and  inter¬ 
actions  discussed  in  this  study  differ  from  the  nested  effects  in  that  the 
nested  effects  do  not  permit  consideration  of  reciprocal  relationship 
between  the  conditional  effect  and  its  conditions,  whereas  the  conditional 
effects  do  permit  establishment  of  reciprocal  relationship  between  the 
conditional  effects  and  its  conditions  and  do  yield  to  meaningful  interpreta¬ 
tion  when  expressed  in  its  reciprocal  form.  Consider  a  nested  effect 
Farm/Counties,  the  reciprocal  nested  effect  Counties/Farm  is  not  defined, 
whereas  a  conditional  effect  A/B  has  a  reciprocal  nested  effect  B/A  which 
is  well  defined.  It  possesses  the  property  of  commutativity  with  respect 
to  the  conditional  operator  The  conditional  effect  and  interaction  not 

only  establish  direct  relationship  between  two  or  more  effects  but  it  also 
yields  information  on  the  reciprocal  effects.  An  effect  or  interaction  is 
usually  defined  orthogonal  to  other  effects  and  interactions.  By  establishing 
direct  and  reciprocal  relationship  among  the  main  effects,  the  conditional 
effects  and  interaction  yield  very  meaningful  and  unambiguous  interpreta¬ 
tion.  By  reducing  the  higher  dimensions  to  lower  interpretable  dimensions 
the  higher  order  interaction  does  yield  informative  information  with  mean¬ 
ingful  interpretation.  All  possible  situations  cannot  be  listed  in  this  note. 

Nowlet  us  consider  a  small  experiment  in  which 
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E(4A)  a  -68;  E(2A/Bq)  =  44;  E^A/Bj)  »  -122 
E(4B)  a  224;  E(2B/AJ  m  190;  EteB/A  1  a  44 

*  u  '  1' 


E(4AB)  >  -156,  where  E  stands  for  effect  of,  and  the  numberical  value 
stands  for  the  magnitude  of  the  yield  associated  with  the  effects.  By 
examining  the  numerical  values  of  the  effects  and  the  conditional  effects 
one  can  immediately  appreciate  the  virtue  of  the  information  given  by  the 
conditional  effects.  The  E(4A)  yields  the  information  that  there  is  loss 
in  the  yield  as  one  increases  the  level  of  A,  but  the  two  conditional  effects 
following  exactly  tells  us  where  is  the  loss  and  where  is  the  gain,  meaning 
that  the  loss  associated  with  A  is  not  a  total  loss.  The  E(4AB)  give 
information  on  the  loss  associated  with  increasing  levels  of  A  or  B  or 
both,  but  the  conditional  effects  associated  with  B  clearly  define  where 
are  the  gains  and  their  exact  magnitudes.  This  is  given  here  purely  from 
the  standpoint  of  appreciation  of  the  usefulness  of  the  conditional  effects. 
The  true  use  of  conditional  effects  is  appreciated  in  systems  where  p  is 
large  and  n  is  large.  An  analysis  of  variance  on  conditional  effects  of 


this  experiment  is  given  in  the  Appendix,  Table  2.  The  Appendix  also 
contains  a  table  (Table  3)  of  effects  and  sum  of  squares  for  a  3^-experi- 


ment,  where  the  polynomial  effects  have  been  isolated.  It  is  interesting 
to  note  that  the  quadratic  effect  of  B(B")  yields  a  gain  of  56  units  in  the 


presence  of  higher  dose  of  factor  A.  The  interpretation  of  the  other  situa¬ 


tions  are  self-explanatory. 
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APPENDIX 


Table  1 

N(n,p)  and  N'(n,p) 

p/n  2  3  4 


7, 

4 

37, 

30 

175, 

160 

20, 

12 

152, 

126 

1040, 

960 

39, 

24 

387, 

324 

3471, 

3216 

Table  2 

ANOVA  2s  With  Two  Replications 

Effect 

d.f. 

S.S, 

M.S. 

F 

A 

1 

578 

578 

16.5* 

A/Bo 

1 

484 

484 

13.8* 

A/Bi 

1 

3136 

3136 

89.6** 

B 

1 

6272 

6272 

179.2** 

B/Aq 

1 

9025 

9025 

257.9** 

B/Aj, 

1 

289 

2B9 

8.3* 

AB 

1 

3042 

3042 

86.9** 

Error*(Total 

-31-4 

35 

Total  (2ar  -  1)  -  7 _ 10032 _ _ 

*  Replication  pooled  with  error. 

**Slgniflcant  at  .01  level  of  probability  of  Type  I  error. 


Table  3 


Contraata 

Sum  of  Squaraa 

Effect* 

for  38 

a 

\8 

i-1  1 

Factorial 

Sum  of  Squaraa 

A* 

-270 

6 

12,150.0 

A7B0 

-123 

2 

7,564.5 

A‘/Bj 

-118 

2 

6,962.0 

A'/B» 

-  29 

2 

420.5 

B* 

-237 

6 

11,008.2 

B7A0 

-  14 

2 

98.0 

B'/Ai 

-135 

2 

9,112.5 

B'/A8 

-108 

2 

5,832.0 

A" 

30 

16 

50.0 

AM/»0 

-  51 

6 

416.7 

A'VBi 

•  16 

6 

42.7 

A"/Ba 

-  97 

6 

1,568.2 

B" 

3 

18 

.5 

b"/ao 

-  28 

6 

130.7 

B' '/Ai 

-  25 

6 

104.2 

B"/AS 

+  56 

6 

522.7 

A'B' 

94 

4 

2,209.0 

A'B" 

84 

12 

588.0 

A"B' 

148 

12 

1,825.3 

A"B" 

78 

36 

169.0 

[■•-V  v 


jfTy& 
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THE  NEGATIVE  BINOMIAL  DISTRIBUTION  APPLIED 
TO  ATMOSPHERIC  PARAMETERS 


_  T' - , 

V*4%«**  QOCUWUilgVl 

Aerophysics  Branch,  Physical  Sciences  Laboratory 
Research  &t  Development  Directorate 
U.  S.  Army  Missile  Command,  Redstone  Arsenal,  Alabama 

ABSTRACT.  The  negative  binomial  distribution  can  be  very  helpful 
for  determining  wind  speed  and  wind  shear  frequency  distributions.  The 
derivation  cf  frequency  distributions  of  vector  wind  shear  data  for  small 
shear  increments  (100  and  50m)  from  existing  common  radiosonde  data 
with  l  km  altitude  intervals  is  explained.  The  frequency  distributions  for 
smaller  shear  increments  differ  drastically  from  "scaled  down"  distribu¬ 
tions.  Considerable  error  for  engineering  evaluation  would  be  introduced 
if  the  shape  change  of  the  negative  binomial  distribution  with  the  shear 
increment  through  the  change  of  the  mean  and  sigma  were  neglected. 

Finally  computations  of  the  cumulative  90,  95  and  99%  exceedance 
thresholds  for  wind  speed  and  wind  vector  shear  by  use  of  the  negative 
binomial  and  the  bivariate  normal  distribution  are  compared  with  the 
observations.  The  analytical  values  for  employing  the  negative  binomial 
prove  best. 

I.  INTRODUCTION.  Although  the  negative  binomial  distribution  (NBD) 
has  been  known  to  statisticians  far  a  long  time,  applications  in  atmospheric 
physics  are  not  very  wide  spread.  This  can  be  explained  for  the  following 
reasons. 

After  early  discussion  by  Pascal  and  Fermat  [see  Todhunter,  1]  one 
can  find  largely  two  versions  of  interpretation.  Greenwood  and  Yule  [2] 
assume  that  the  events  are  mutually  independent,  but  the  intensity  varies 
from  individual  to  individual  event.  Polya  [3]  and  Eggenberger  [4]  interpret 
that  the  events  are  statistically  dependent,  i.e.  the  occurrence  of  one  event 
increases  the  probability  that  further  events  will  occur. 

In  the  latter  sense  applications  have  been  attempted  mainly  for  distribu¬ 
tion  of  precipitations  or  runs  of  days  with  or  without  precipitation  [  see 
Wanner,  5,  6]  .  As  has  been  pointed  out  by  the  author  [7]  ,  applications  to 
the  continuous  frequency  distribution  of  precipitation  prove  to  be  a  problem. 
Therefore  Thom  [8]  has  suggested  the  use  of  the  incomplete  gamma  func¬ 
tion.  Recently  the  model  of  the  Markov  chain  [see  Caskey,  9  and  Weiss,  10] 
has  been  more  successfully  applied.  Thus  utilization  of  the  negative 
binomial  for  the  field  of  precipitation  appears  to  be  very  limited. 


In  the  sense  of  Greenwood  and  Yule's  interpretation  the  NBD  may 
apply  for  wind  and  wind  shear.  One  would  expect  from  other  theoretical 
background,  however,  that  the  wind  vector  follows  a  bivariate  normal 
distribution  (the  components  being  normally  distributed).  Then  the  non 
central  chi-square  distribution  should  adequately  describe  the  distribu¬ 
tion  of  the  scalar  wind  speed. 

The  non  central  chi-square  distribution,  however,  does  not  fit  extreme 
values  very  well,  especially  for  wind  shear  distributions  of  smaller 
increments  [Easenwanger,  11]  .  Thus  one  may  attempt  to  fit  the  empirical 
distribution  with  the  NBD,  as  is  later  demonstrated.  An  earlier  applica¬ 
tion  has  been  made  by  Wanner  [l 2]  ,  who  concludes  that  the  frequency 
distribution  of  the  wind  speed  follows  the  shape  of  the  NBD  the  closer  the 
higher  the  altitude  of  his  sampling  (mountain  observations).  Since  the 
present  discussion  is  mostly  concerned  with  upper  atmospheric  observa¬ 
tions,  the  employing  of  the  NBD  with  wind  data  may  be  investigated. 

II.  FREQUENCY  DISTRIBUTIONS  OF  WIND  AND  WIND  SHEAR.  As 
previously  mentioned  the  NBD  is  employed  to  describe  the  observed  fre¬ 
quency  distributions  of  wind  speed  and  wind  shear  values.  It  is  therefore 
of  vital  interest  to  ascertain  how  close  is  the  agreement  between  observed 
and  analytical  distribution.  Further,  since  the  NBD  is  a  discontinuous 
distribution,  testing  has  to  proceed  to  determine  whether  the  given  class 
division  of  the  continuous  wind  distribution  can  be  adequately  adjusted  to 
provide  fair  resemblance  with  the  NBD.  This  adjustment  is  difficult  for 
precipitation  [see  Essenwanger,  7]  . 

The  problem  is  discussed  by  the  author  in  detail  In  a  recent  report  [13]  . 
Figure  1  serves  as  an  example  to  summarize  results  for  wind  shear  distri¬ 
butions.  The  figure  displays  a  typical  wind  shear  distribution  for  1  km 
shear  intervals  (histogram  at  top  of  figure  1).  The  other  3  diagrams 
exhibit  the  deviations  from  the  observed  frequency  for  3  types  of  fitted 
curves,  the  NBD,  the  incomplete  gamma  function  with  maximum  likelihood 
fit  and  with  moments  fit.  Statistical  tests  showed  no  significant  difference 
between  these  3  fitted  curve  types  and  the  observation. 

The  NBD  was  selected  for  its  convenience  of  computation.  Since  there 
are  no  observed  data  on  the  frequency  distributions  of  smaller  shear 
increments  and  the  recommended  distribution  is  predicted,  data  for  a 
maximum  likelihood  fit  of  the  gamma  function  are  not  available.  ThuB  both 
analytical  distributions  rely  on  the  moments  fit.  It  is  immaterial  to  select 
the  negative  binomial  rather  than  the  incomplete  gamma  function.  It  will 
be  further  seen  that  observed  data  in  the  range  of  extreme  values  such  as 
threshold  exceeded  by  10,  5  and  1%  of  the  data  fit  the  observations  quite 
well  with  the  NBD. 
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III.  WIND  SHEAR  INTERVALS  OF  SMALL  INCREMENTS.  In  a  basic 
article  [14]  the  author  has  derived  that  a  relationship  between  the  mean 
■hear  v  and  the  shear  interval  AI,  caIbIo  a«  follows 

Vh)  •  ao  t4h)‘‘ 

where  and  are  constants  depending  on  climatological  conditions. 

The  Ah  denotes  the  difference  of  the  altitudes  (shear  interval),  from  which 
the  vector  shear  v  as  the  residual  of  two  wind  vectors  is  computed. 

It  has  further  been  deduced  that  a  similar  relationship  holds  for  the 
standard  deviation 

(2)  <r(ih)  =  Ao  +  BjtAh)*! 

where  Aq  arid  are  again  constants  depending  on  climatic  conditions. 

The  constant  A  can  be  determined  from 
o 

<3>  '(Ah)  =  A«  +  A1  '(Ali)  ’ 

Equation  (1)  has  further  been  confirmed  by  Armendariz  and  Rider  [15]  and 
Beimont  and  Shen  [16]  .  Although  Armendariz  and  Rider  [15]  derived  a 
similar  equation  to  (l)  for  the  standard  deviation,  which  means  A  in  (2) 
would  equal  zero,  it  is  presently  open  whether  A  approximates  zero  in 
the  ground  layers,  from  which  Armendariz1  and  Ftider's  data  are  derived, 
while  the  author  included  data  up  to  50  km  altitude  [see  Reisig  17]  .  The 
absence  of  A  may  be  further  an  effect  of  the  terrain,  as  Armendariz  and 
Rider  work  with  data  from  White  Sands,  New  Mexico,  while  the  author's 
data  were  obtained  at  Cape  Kennedy,  Florida. 

One  has  now  two  parameters,  the  mean  and  standard  deviation,  which 
can  be  utilized  to  compute  the  expected  frequency  distribution  of  shear 
values.  Figure  2  demonstrates  the  agreement  between  observed  and. 
analytical  distributions,  employing  the  NBD  for  computing  the  analytical 
model.  Five  layers  from  various  conditions  of  upper  atmospheric  shear 
distributions  have  been  selected.  The  first  3  layers  show  excellent  agree¬ 
ment  between  analytical  and  observed  data.  Some  discrepancies  are  noted 
for  15-20  and  20-25  km.  Although  the  deviations  are  statistically  not 
significant,  the  problem  of  a  distribution  with  a  better  fit  or  some  adjust¬ 
ment  to  the  fitting  procedure  iB  still  open. 


» 
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One  point  must  be  stressed,  however.  The  present  method,  employing 
the  NBD  for  describing  the  analytical  distribution  is  far  superior  to  the 
generally  practiced  technique  of  "scaling  down"  frequency  distributions  of 
vector  wind  shears,  A  typical  example  is  given  in  Table  1,  For  3  atmos¬ 
pheric  layers  a  comparison  was  made  between  analytically  derived  and 
scaled  down  distributions.  The  "scaling  down"  technique  assumes  that  the 
same  distribution  for  smaller  shear  intervals  as  for  larger  intervals 
exists,  e.  g.  the  regular  available  shear  distribution  of  1  km  intervals 
(easily  obtained  from  the  present  radiosonde  network)  would  be  divided  by 
10  to  obtain  the  distribution  of  190  m  shear  interval.  Table  1  demonstrates 
clearly  that  this  technique  is  Out  of  place  as  it  does  not  take  into  considera¬ 
tion  any  shape  and  scale  change.  The  real  distribution  produced  such 
changes,  which  are  quite  adequately  expressed  by  the  NBD.  It  is  impor¬ 
tant  to  include  these  changes  into  the  derived  frequency  distribution.  As 
becomes  quite  obvious  from  Table  1,  considerable  error  for  engineering 
evaluation  would  be  introduced  if  the  shape  changes  of  the  negative  binomial 
distribution  with  the  shear  increment  through  the  change  of  the  mean  and 
sigma  were  neglected. 

More  details  can  be  found  in  pertinent  articles  as  cited  under  13  and 
14. 


IV.  COMPUTATION  AND  COMPARISON  OF  90,  95  AND  99  PERCENT 
PROBABILITY  THRESHOLDS.  One  of  the  important  criteria  in  missile 
application  are  percentile  values  such  as  the  thresholds  of  shear  values 
exceeded  in  10  percent  of  the  cases  or  similar  tolerance  values.  Thus  it 
is  quite  reasonable  to  demand  that  the  employed  distribution  must  be 
successful  in  describing  said  thresholds.  The  90,  95  and  99%  observational 
values  were  selected  for  this  purpose. 

Three  types  of  distribution  were  tested,  the  bivariate  distribution  (BD), 
the  negative  binomial  and  the  incomplete  gamma  function  (IGF).  The  results 
for  the  NBD  and  the  IGF  were,  however,  similar  and  showed  no  statistically 
significant  or  obvious  differences.  Thus  the  comparison  between  NBD  and 
IGF  may  be  omitted  here. 

a.  Computation  of  the  Threshold  for  the  Bivariate  Distribution. 


The  computation  of  the  90,  95  or  99%  value  for  the  bivariate 
distribution  is  quite  cumbersome.  One  has  to  solve  the  following  type  of 
integral 


(4) 


V)* 


I 


v  f,  .dv 

(v) 
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where  denotes  the  probability  level  of  the  threshold,  the  thresh¬ 

old  value  of  the  wind  «nA»r|  v  the  wind  speed  and  f^  the  distribution 

function,  in  this  case  the  bivariate  normal  distribution.  A  similar  equation 
exists  for  the  wind  shear,  replacing  v  by  the  pertinent  parameter  for  the 
wind  shear. 


The  solution  is  complicated,  but  can  be  approximated  by  the 
cumulative  distribution  of  the  non  central  x^  distribution  or  by  determining 
an  offset  circle  of  the  bivariate  distribution  (see  Id).  In  the  present 
application,  one  has 


and  1  <  R  <  5.  Thus  the  equation  for  approximation  is  transformed  for 
obtaining  R  in  explicit  form: 

.1  3 

-2  r.  2  ,1+b,  .  2  ,1+b.  2 


where 


r,  &  liTD|  &  /ATP. 

R  =a(l  -,{—)  +  -  {— )  C] 


C  =  [■(£«  P  +  in  /27)  ]  * 


a  -  2  +  r 


vr  denotes  the  resultant  wind  vector  or  the  equivalent  for  the  shear.  The 
solution  presents  no  problem  when  employing  a  high  speed  electronic 
compute  r. 

A  simplified  approach,  provides  the  same  correlation  coefficient, 
although  in  the  winter  months  the  average  threshold  is  slightly  higher  than 
the  observed  value.  This  can  be  based  upon  the  following  assumption. 


The  mean  wind  speed  v  can  be  computed  from 


-  y 


v  f,  ,  dv  . 

w 


226 


If  one  considers  that  the  C  is  taken  from  the  circular  normal  distri- 

o 

bution,  where  <r  *  <r  =  <r  ,  then  the  high  correlation  between  observed 
x  y  v  “ 

and  computed  values  of  the  90,  95  and  even  99%  as  later  shown  is 
remarkable. 


TABLE  2 


b.  Computation  of  the  Threshold  for  the  Negative  Binomial  Distribu- 

tion. 

The  computation  of  the  threshold  value  for  the  negative  binomial 
is  also  based  on  a  solution  of  the  integral  (3)  as  before  but  this  time  the 
f^  is  the  negative  binomial  distribution  and  the  integral  is  one -dimensional. 

Similar  explicit  formulae  as  for  the  bivariate  distribution  are  presently 
not  available.  One  can  convert  the  cumulative  NBD,  however,  into  the 
incomplete  beta  function.  This  was  pointed  out  by  Pearson  and  Fieller  [19]  , 
or  rediscovered  by  Patil  [20]  and  was  recently  discussed  by  Bartko  [21,  22]  . 
The  90,  95  and  99%  values  can  then  be  obtained  from  the  tables  of  the 
incomplete  beta  function  [23]  .  The  procedure  is  somewhat  elaborate,  but 
does  not  involve  computations  of  the  cumulative  distribution  by  electronic 
computer.  It  was  performed  to  obtain  the  necessary  analytical  values  for 
comparison  v/ith  the  observed  threshold. 

Although  the  maximum  likelihood  fit  could  have  been  utilized  by 
employing  the  frequency  distributions  and  finding  solutions  to  the  maximum 
likelihood  equation  (Haldane,  [24]  and  cited  by  Bartko,  [22]  ),  the  moments 
method  for  parameter  estimation  was  employed  for  the  following  reasons. 

One  of  the  goals  is  the  derivation  of  distributions  for  small  shear  intervals, 
for  which  the  frequency  distribution  is  not  known.  Thus  the  information 
necessary  for  the  maximum  likelihood  fit  is  not  available,  while  the 
parameters  for  the  moments  fit  can  be  computed.  If  the  NBD  with  moments 
fit  would  therefore  give  a  poor  result  for  computation  of  the  threshold  values, 
the  NBD  could  not  be  used  without  first  developing  a  prediction  scheme  for 
the  information  needed  for  maximum  likelihood  fit.  Thus  the  question  of 
maximum  likelihood  fit  is  of  secondary  importance  for  this  particular 
problem. 


When  using  the  tables  of  the  incomplete  beta  functions  [23]  ,  the 
parameters  p,  q  and  the  scale  parameter  b  must  be  known.  They  have  been 
obtained  from 


(17) 


p  =  2 


<r 


(18) 


q 


-2 


or 
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where  is  the  third  moment  with  reference  to  x,  the  mean  and  <r  the 

variance1*1.  The  pertinent  parameters  for  wind  and  wind  shear  have  to  be 
introduced  into  equations  (17)  thru  (19). 

The  threshold  value  then  becomes 

(20a)  v ^  a  b  (l  -  x^)  or 

(20b)  v^  *  b 

■ 

depending  on  whether  q  >  p  (then  20a)  or  q  <  p  (then  20b). 

c.  Comparison  of  the  Computed  Thresholds  with  the  Observed  Values. 

The  threshold  values  of  90,  95  and  99%  were  computed  for  wind 
speed  and  wind  shear  for  several  stations  and  compared  with  the  respective 
observed  values.  The  latter  were  obtained  from  a  computer  program, 
listing  certain  thresholds  of  the  cumulative  distributions  as  begun  in  the 
Climatological  Ringbook  [25]  . 

The  differences  between  the  computed  and  observed  thresholds 
could  have  been  checked  with  the  Chi-square  test  for  statistical  significance 
of  the  deviations.  Since  the  computed  values  were  close  to  the  observed 
thresholds,  another  tool  of  comparison  has  been  employed.  It  was  obvious 
from  randomly  selected  samples  that  the  chi-square  test  would  not  render 
statistical  significance  for  most  of  the  deviations  of  the  computed  thresh¬ 
old  from  the  observed  values.  Thus  the  correlation  coefficient  was 


‘'‘Footnote:  The  p,  for  the  negative  binomial  distribution  is  known,  when 

-  ^  2  -  2 
the  x  and  the  <r  are  known;  p  a  x  (l  +  3d  +  2d  ),  where 

cr2 

d  +  1  =  . 

x 


4 

i 
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utilized,  which  cannot  only  give  information  about  the  agreement  between 
theory  and  observation,  but  can  also  delineate  a  systematic  bias,  if  the 
iiiKsatio  iti  tliv  analytical  ami  uboerveu  threshold  differ, 

The  correlation  coefficients  are  contained  in  Tables  3-5  for  the 
data  of  Montgomery,  Alabama  as  a  typical  example  of  the  results.  It  is 
evident  from  the  tables  that  the  correlation  is  very  high  and  therefore  the 
analytical  values  are  very  close.  However,  a  detailed  inspection  of  the 
coefficients  shows  that  there  are  some  differences.  First  one  notices  that 
the  coefficients  display  a  slight  tendency  to  decrease  towards  the  99% 
threshold.  Thus  the  analytical  values  appear  to  fit  less  towards  the 
extreme  Values,  Further,  this  tendency  to  decrease  is  more  pronounced 
for  the  bivariate  fit  than  for  the  negative  binomial  and  more  for  the  wind 
shear  than  for  the  wind  speed.  This  result  is  not  unexpected.  The 
tendency  of  deviations  from  the  bivariate  distribution,  especially  for  wind 
shears,  has  been  pointed  out  by  the  author  in  an  earlier  article  [l  l]  . 
Further,  the  analytical  method  for  the  bivariate  distribution  approximates 
the  thresholds  by  either  using  mean  wind  speed  only  as  in  equation  (l6) 
or  basing  it  on  the  circular  distribution  for  equation  (6).  The  method 
employing  the  negative  binomial  avoids  these  problems.  Besides  the  mean, 
the  variance  of  the  distribution  is  needed,  and  in  our  particular  case  the 
variance  of  the  wind  speed  and  wind  shear.  By  fitting  the  incomplete  beta 
function,  even  the  third  moment  could  be  included,  which  is  a  3  para¬ 
meter  fit.  Thus  the  basis  for  analytically  determining  the  thresholds 
comprises  more  or  better  parameters  for  the  negative  binomial  approach. 
The  result  confirms  this.  The  analytical  thresholds  agree  better  with  the 
observed  ones  for  the  negative  binomial  method. 

Whether  there  is  a  bias  between  the  computed  and  observed  thresh¬ 
olds  can  be  answered  from  Table  6,  where  typical  examples  for  the  wind 
speed  thresholds  are  displayed.  The  results  for  the  negative  binomial 
distribution  look  generally  good,  although  there  is  a  slight  tendency  towardB 
a  lower  average  than  the  observed.  But  the  result  may  be  considered  within 
the  tolerance  limits  of  errors.  The  scatter  for  the  analytical  values 
around  the  average  v.  ■  expressed  by  the  standard  deviation  or  is  the 

same  as  for  the  observed  values,  denoted  by  <rQ.  This  confirms  the 
closeness  of  the  computed  results  in  addition  to  the  high  correlation 
coefficient. 

The  averages  for  the  bivariate  distribution  also  agree  very  well, 
thus  no  systematic  large  bias  is  visible.  It  is  noticed,  however,  that  the 
scatter  represented -by  cra  is  higher  than  the  scatter  for  the  observed 
data.  This  indicates  that  not  all  of  the  computed  values  have  good  agree¬ 
ment,  a  conclusion  already  stated  above  in  the  consideration  of  the 
correlation  coefficient, 
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Table  4 


LINEAR  CORRELATION  COEFFICIENT  FOR  COMPARISON  OF 
OBSERVED  AND  ANALYTICALLY  DERIVED  95*  LEVEL 


Bivariate 

Neg.  Binomial 

Bivariate 

.980 

.996 

.940 

.981 

.992 

.863 

■  993 

•  997 

.950 

.991 

.997 

•  972 

.803 

.906 

.986 

.809 

.898 

.966 

.99* 

•  997 

.972 

.990 

.997 

•931 

Wind  Shear 


Neg.  Binomial 


Average 


Average 


Table  5 


LINEAR  CORRELATION  COEFFICIENT  FOR  COMPARISON  OF 
OBSERVED  AND  ANALYTICALLY  DERIVED  99 $  LEVEL 


Montgomery 


Wind  Speed 

Wind  Shear 

Bivariate 

Neg.  Binomial' 

Bivariate 

Neg.  Binomial 

.980 

■995 

.868 

.936 

•  975 

■  990 

.721+ 

.925 

.986 

.996 

.785 

.888 

.978 

•  -998 

•  930 

•  975 

•  752 

.919 

.932 

.980 

.680 

.929 

.854 

.958 

* 

.976 

.992 

.860 

.902 

.980 

.997 

.728 

.9^3 

.913 

•  977 

.838 

.938 

Table  6 


Comparison  of  Computed  and  Observed  Thresholds  ( Summary) 

( a)  bivariate 


Station 

Threshold 

N 

\ 

■9 

n 

r 

El  Paso 

90* 

224 

24,4 

25.8 

14.1 

12.6 

.984 

95 

224 

27*9 

27.2 

16.1 

14.2 

.970 

99 

224 

34.6 

33.9 

20.0 

16.4 

.929 

Chateauroux 

90 

248 

24.1 

25.5 

U.9 

12.2 

.970 

95 

248 

27.6 

29.5 

13.5 

l4 . 2 

.967 

99 

248 

34.2 

37.8 

16.8 

17.2 

.941 

Montgomery 

90 

572 

2  6.6 

25.6 

18.3 

15.3 

.984 

95 

372 

30.3 

29.2 

21.1 

16.8 

.970 

99 

372 

37.6 

36.2 

26.1 

19.5 

.951 

( b)  negative  binomial 


Montgomery 

90 

372 

24.9 

25.6 

15-3 

15.3 

..992 

95 

372 

28.2 

29.2 

16.8 

16.8 

.993 

99 

372 

34.4 

36.2 

19.5 

19.5 

.992 

v.  mean  wind  speed  of  computed  threshold  (m/sec) 
L 

V.  mean  wind  speed  of  observed  threshold  (m/sec) 
L 

og  standard  deviation  of  analytical  values 

a  standard  deviation  of  observed  values 
o 

r  correlation  coefficient 


One  may  think  about  other  distribution  functions  as  being  more 
suitable  for  deriving  analytical  values  such  as  the  Weibull  distribution 
T26]  .  The  negative  hinomial  distribution,  however,  describes  the 
thresholds  already  satisfactorily  and  preliminary  computations  with  the 
Weibull  distribution  did  not  render  better  results  rather  than  thresholds 
in  the  middle  between  the  bivariate  and  negative  binomial  method.  Besides, 
it  is  very  difficult  to  objectively  determine  the  location  parameter  for  the 
Weibull  distribution,  and  thus  the  negative  binomial  distribution  offers  an 
advantage  in  the  estimation  of  parameters.  Under  these  circumstances 
the  question  of  determining  the  thresholds  based  on  the  Weibull  distribution 
is  not  further  pursued  for  this  report. 

V.  SUMMARY  AND  CONCLUSIONS.  It  has  been  demonstrated  that 
the  negative  binomial  distribution  has  its  place  in  problems  of  atmospheric 
physics,  especially  in  missile  climatology  for  wind  speed  and  wind  shear 
distributions.  For  this  purpose  the  NBD  serves  largely  as  a  practical 
and  convenient  tool  for  describing  the  frequency  distribution.  Especially 
the  application  to  derive  realistic  frequency  distributions  of  wind  shear  for 
small  increments  is  important.  This  technique  is  far  superior  to  the 
general  practice  of  scaling  down  wind  shear  distribution  for  larger  intervals 
which  are  commonly  available.  The  utilization  of  the  NBD,  however,  can 
accommodate  the  change  of  shape  of  the  distribution  with  the  shear  interval, 
a  property,  which  the  scaling  down  neglects,  Considerable  error  for 
engineering  application  may  arise  if  this  shape  change  is  overlooked. 

It  haB  further  been  discussed  in  detail  that  the  NBD  can  also  be  useful 
in  deriving  threshold  values  for  the  cumulative  90,  95  and  99%  levels,  if 
mean  and  variance  for  the  distribution  are  known.  Comparison  between 
analytically  derived  and  observed  thresholds  displayed  excellent  agreement 
without  bias.  The  method  proved  superior  to  the  application  of  the 
bivariate  normal  distribution  for  the  same  purpose.  The  only  advantage 
for  the  latter  practice  could  be  the  possibility  of  establishing  a  relationship 
between  the  threshold  value  and  the  mean,  aB  expressed  in  equation  (16). 

In  this  relationship  one  parameter,  the  mean  only,  needs  to  be  known. 

This  simplifies  the  computation  of  statistical  parameters  and  increases 
the  use  of  numerous  data  collection,  in  which  the  mean  only  is  given. 

The  conversion  of  the  NBD  for  the  use  of  the  tables  of  the  incomplete 
beta  function  [23]  to  obtain  the  pertinent  threshold  values  has  been 
described.  The  need  for  knowing  the  third  moment  does  not  introduce 
a  new  condition,  as  the  P3  for  the  NBD  is  known  with  given  rAean  and 
variance.  Making  use  of  three  parameters,  however,  points  to  the 
possibility  of  utilizing  the  incomplete  beta  function  for  the  curve  fitting, 
although  the  third  moment  113  then  must  be  computed  from  the  observa- 
tions  to  offer  some  advantage.  Utilization  of  two  parameters,  mean  and 
variance,  is  sufficient  only  for  the  NBD. 
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Another  3  parameter  fit  would  be  the  Weibull  distribution.  Prelimi¬ 
nary  computations  did  not  produce  better  results,  however,  and  therefore 
no  detailed  discussion  and  comparison  were  included  in  this  report. 

The  NBD  has  therefore  a  definite  place  among  the  statistical 
distributions  useful  for  application  to  atmospheric  parameters, 
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TRIAL  VARIABILITY  INTERPRETED  AS 
DIFFERENCES  IN  TRANSLATION  OR  ROTATION  IN 
FUNCTION  ANALYSIS  OF  VARIANCE 

Walter  D.  Foster 

U.  S.  Army  Biological  Laboratories 
Fort  Detrick,  Frederick,  Maryland 

ABSTRACT.  Referee  experimentation  connotes  in  general  a  set 
of  participants  performing  the  same  experiment  under  nearly  identical 
circumstances.  Variance  analysis  of  results  often  takes  the  form  of 
Between  Stations  and  Within  Stations.  As  a  device  for  interpreting  the 
magnitude  of  the  mean  square  for  repeated  trials  at  a  station,  the  mean 
square  is  converted  to  a  corresponding  vertical  change  in  centroid 
(translation)  or  to  a  change  in  slope  (rotation).  The  variable  of  analysis 
is  a  multiple -parameter  function  representing  decay. 


The  concept  and  practice  of  the  Analysis  of  Variance  when  the  response 
variable  is  a  function  rather  than  a  single  value  was  given  by  Foster  [l] 
in  1962.  Comparison  of  this  technique  to  the  multivariate  analysis  of 
variance  was  given  by  Foster  [2]  in  1963.  Brownlee  [3]  showed  how  to 
make  simultaneous  tests  of  slopes  and  centroids  if  the  response  is  a  linear 
function  with  two  parameters.  Church  [4]  gave  the  partition  of  variance 
for  a  factorial  experiment  for  each  parameter  of  a  curvilinear  model  when 
used  as  the  response  variable. 

The  development  in  this  paper  is  described  in  terms  of  its  application: 
referee  experimentation.  Referee  (or  collaborative  or  standardization)  ex¬ 
periments  consist  basically  of  several  independent  laboratories  performing 
the  same  experiment  in  nearly  identical  circumstances.  The  simplest  case 
compares  laboratories  (or  stations  as  they  are  referred  to  here)  using 
repeated  trials  at  each  station  as  the  criterion  --  the  standard  Between  and 
Within  analysis  of  variance.  A  more  sophisticated  design  would  introduce 
a  range  of  treatments  in  order  to  estimate  a  Station  X  Treatment  effect. 
Thus,  the  two  major  objectives  of  a  referee  experiment  are  the  comparison 
of  stations,  treatment  means  and  the  estimation  of  reproducibility  at  each 
station.  When  each  trial  is  a  biological  aerosol  produced  in  a  closed 
chamber  and  allowed  to  settle,  the  response  is  the  decay  function  which 
describes  the  loss  of  biological  activity  with  time; 

C  =  CQ(t  +  i)*Vkt 


la  the  decay  model  chosen  for  this  analysis.  The  comparison  of  stations, 
treatments  and  S  x  T  was  given  by  Foster  [l]  .  It  is  the  purpose  of  this 
paper  to  examine  the  mean  square  for  repeated  trials  at  a  station  which 
was  used  as  a  measure  of  reproducibility  and  to  translate  this  variance 
whose  magnitude  is  generally  meaningless  to  the  experimenter  into  a 
familiar  scale  to  facilitate  subjective  appraisal  and  evaluation  of  repro¬ 
ducibility. 

Using  the  techniques  of  multiple  regression,  the  data  for  a  single  trial 
of  n  points  can  be  represented  by 

In  C  e  in  -  b  ln(t  +  1)  -  kt 


where  In  C  ,  b,  and  k  are  estimated  by  least  squares.  Partition  of  the 
o 

variation  in  the  analysis  of  variance  format  is  given  in  Table  I,  using 
Snedecor's  [5]  notation. 


TABLE  I 

.  A.  V.  for  a  Single 

Trial 

Line 

Source 

df 

SS 

1 

Function 

3 

2 

Co 

1 

(SY)2/n 

3 

b,  k 

2 

bSXjy  +  kSx2y 

4 

Deviations 

n-3 

2 

Sy  -  bSXjy  -  kSx2y 

5 

TOTAL 

n 

SY2 

Note:  Xj  = 

ln(t  +  1)  ;  X2  =  t  ; 

Y  =  In  C 

xl  = 

Xi-Xl  ;  x2=X2- 

■  *2  •'  y  =  Y  -  Y 

For  t  trials  at  a  station,  the  analysis  of  variance  of  the  decay  curve, 
showing  partition  and  corresponding  sums  of  squares  is  given  in  Table  II. 
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cv  TABLE  II.  A.  V.  for  t  Trials 


Line 

6 

Source 

Mean 

d i 

3 

In 

(s 

1 

Y)2/tn  + 

SR 

tn  tn 

b  S  x.y  +  k  S  x  y 
1  l  2 

7 

Among  trials 

3(t 

-1) 

t 

E 

1 

Line  1^  - 

Line  6 

8 

Deviations 

t(n 

-  3) 

t 

2 

1 

Line  4 

9 

TOTAL 

tn 

t 

2 

Line  5 

¥  • 

1 


When  the  mean  decay  function  for  a  station  is  compared  to  those  of 
other  stations,  the  comparison  is  both  visual  and  objective  --  visual  because 
the  functions  can  be  graphed  and  their  parameters  tabled;  objective  because 
a  test  of  significance  is  available  [l]  ,  but  not  given  here.  Thus,  the  com¬ 
parison  of  means  is  complete  and  in  a  scale  meaningful  to  the  participants. 
Comparisons  of  trial  M  S  for  the  various  stations  can  also  be  done  statisti¬ 
cally,  but  the  mean  square  itself  has  little  meaning  to  the  experimenter. 

Two  strategems  involving  translation  and  rotation  in  the  original  scale 
are  presented  as  a  method  of  interpreting  the  magnitude  of  the  trial  mean 
square,  Since  most  aerobiologists  are  thoroughly  familiar  with  the  simple 
exponential  function, 

_  -kt 

C  =  C  e 
o 

as  a  decay  model,  the  trial  mean  square  has  been  scaled  into  translations 
of  Cq  and  into  rotation  of  k.  The  technique  is  simple. 

Let  the  experimenter  visualize  the  trial  variability  as  being  expressed 
by  two  parallel  lines,  the  plot  of 

In  C  =  In  C  -  kt 
o 


whose  vertical  separation  or  translation  is  equivalent  to  the  trial  variability. 
Obviously,  the  greater  the  variability,  the  greater  the  distance  between  the 
two  parallel  lines.  He  thus  may  consider  his  trial  variability  as  if  he  had 


run  only  two  aerosols  with  equal  decay  rates  but  displaced  starting  points 
(intercepts). 

Algebraically,  the  displacement  or  translation  is  derived  by  consider' 
ing  the  same  partition  of  the  trial  decay  functions  in  Table  II  with  only  two 
trials.  This  is  shown  in  Table  III.  The  notation  has  the  form  of 

Y  =  ?  -  b  (X  -  x)  . 

TABLE  HI.  Development  of  Trial  Variability  as  Translation 


Line 

Identification 

SS 

10 

SS  Function  1: 

n  Y2  +  bSxy 

11 

SS  F unction  2: 

n  Y2  +  b  Sxy 

12 

Line  10  4  11: 

n(Y*  +  Y^)  +  2b  S  xy 

13 

Mean  Function; 

2n[  ( Yj  +  Y2)/2]  2  +  2bSxy 

14 

Line  12  -  Line  13: 

-  52>7 2 

The  Mean  Square  corresponding  to  the  sum  of  squares  in  Line  14  is  simply 

n<?v  -  Y2)2/4  • 

Upon  equating  the  observed  trial  mean  square  to  the  derived  translation 
and  solving  for  the  translation,  we  have 


Y  -  Y 
1  2 


4  MS  Trials/n  , 


which  as  a  distance  applies  to  the  intercepts,  In  C  ,  as  well  as  to  the 
centroids  because  of  the  assumed  parallelism. 

The  following  example  of  six  trials  at  a  station  illustrates  the  use 
of  this  technique. 
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Function  Analysis  of  Variance 


L,ine 

Source 

df_ 

MS 

15 

Mean 

3 

2427.4869 

16  . 

Among  Trials 

15 

.  1975 

17 

Deviations 

30 

.0878 

V72 

=  V4(.  1975)/8  = 

.  314 

in  In  C  scale, 

or  a  1.  37  fold  (antiln  .  314)  effect. 

Had  the  trial  mean  square  been  .  960  the  translation  would  have  been 
Y  -  Y,  -  V  4(.  96)/8  =  .  693  or  a  2.  0  fold  effect. 

1  b 


Interpreted  to  the  aerobiologist,  trial  variation  of  this  magnitude  (MS  = 

,  1975)  implies  that  his  ability  to  reproduce  an  aerosol  is  no  better  than 
1.  37  fold.  It  should  be  noted  in  passing  that  the  translation  concept  is 
applicable  to  any  decay  function  under  the  requirement  of  parallelism, 

The  second  approach  to  relate  trial  variability  to  experience  is  by 
rotation,  i.e.  ,  a  change  in  the  slope  of  the  linear  decay  function;  in  this 
case  it  refers  to  a  change  in  the  parameter  k.  The  centroids  for  each  of 
two  lines  are  required  to  be  identical;  the  MS  for  trial  variability  is 
equated  to  change  in  slope,  This  approach  is  more  subtle  since  changes 
in  k  effected  through  equivalent  size  of  the  mean  square  depend  upon  the 
domain  of  the  independent  variable  and  the  particular  design.  For  a  large 
domain  the  change  in  k  will  be  small;  for  a  narrow  interval,  the  change 
will  be  large  (because  the  variance  of  slope  is  proportional  to  l/S  x2). 

The  development  is  given  in  the  following  table.  The  notation  again  has  the 
form  of 


Y  =  Y  +  b(X  -  X) 
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TABLE  IV.  Development  of  Trial  Variability  as  Rotation 


Line 

Identification 

SS 

15 

SS  Function  Is 

nY2  +  b^Sxy^ 

16 

SS  Function  2; 

n?2  +  b2  S*y2 

T7 

Line  1 5  &  16: 

2n  Y2  +  b^  Sxy^  +  b2  Sxy2 

18 

Mean  Function; 

2n  ?  +  (  2  )(Sxyi  +  Sxy2) 

19 

Line  17-18; 

(bj  -  b2)2  S  x2/2 

The  Mean  Square  corresponding  to  the  Sum  of  Squares  in  Line  19  is  simply 

(bl  -  b2)2  S  x2/4  . 

As  before,  this  quantity  is  equated  to  the  observed  trial  mean  square 
and  the  amount  of  rotation  is 

-  b2  =  V4(.  197 5)/l.  66  X  103 

=  .022  . 

Note  that  the  apparently  small  change  in  slope  is  due  to  the  extremely 
large  domain  of  t,  1300  minutes.  While  the  concept  of  translation  was 
applicable  to  any  decay  function,  the  rotation  approach  required  a  linear 
model  for  its  straight-forward  interpretation  as  a  change  in  a  single 
parameter. 
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A  METHOD  FOR  ADJUSTING  FOR  PARTICLE  SIZE  AND  MATRIX 
EFFECTS  IN  THE  X-RAY  FLUORESCENCE  ANALYSIS  PROCEDURE 
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Virginia  Polytechnic  Institute,  Blacksburg,  Virginia 
Donald  Womeldorph 

Phillips  Petroleum  Company  ! 

Bartlettsville ,  Oklahoma 

X-Ray  fluorescence  methods  have  been  widely  used  in  the  analysis  of 
multicomponent  mixtures.  The  advantage  is  due,  of  course,  to  the  high 
speed  and  precision  of  the  method.  It  is  unfortunate,  however,  that,  one 
is  not  always  able  to  attain  accurate  analyses  in  practice  because  of  the 
existence  of  sample  matrix  effects  and  particle  size  effects. 

Existence  of  matrix  effects  implies  that  the  intensity  of  fluorescent 
radiation  from  the  analytical  element  is  a  function  of  the  concentration  of 
the  matrix  elements  as  well  as  its  own  concentration.  This  phase  of  our 
problem  has  been  discussed  by  several  workers.  Mitchell  [7]  describes 
the  problem  in  elaborate  detail.  In  a  recent  paper,  Alley  and  Myers  [l] 
discuss  ways  of  using  inverse  estimation  in  linear  regression  to  account 
for  these  effects.  Also,  Campbell  and  Brown  '{ 3 j  have  reviewed  mathe¬ 
matical  and  empirical  methods. 

The  consideration  of  particle  size  of  the  components  is  extremely 
important  in  X-Ray  fluorescence  analysis  for  the  case  of  granular  materials. 
In  fact,  variations  in  particle  size  of  the  materials  can,  in  some  cases, 
have  a  greater  effect  on  the  X-Ray  intensity  than  variations  in  concentra¬ 
tion.  The  fluorescent  X-Ray  intensity  is  affected  by  both  the  fluorescent 
and  matrix  component  particle  sizes  and  their  relative  concentrations  in 
the  sample.  Claisse  and  Sampson  [4]  ,  and  Bernstein  [2]  discuss  the 
particle  size-intensity  relationship. 

This  paper  describes  the  use  of  a  procedure  involving  estimation  in  a 
statistical  functional  relationship  to  approximate  the  structural  form  that 
exists  between  the  X-Ray  intensity  of  each  component  and  the  concentra¬ 
tion  of  all  of  the  components  in  the  mixture.  The  non-linear  functional 
relationships,  which  include  the  effects  of  measurement  errors,  permit  the 
estimation  of  component  concentrations  in  unknowns  over  wide  ranges  at 
constant  particle  size  by  using  data  obtained  from  the  analyses  of  a  series 
of  calibration  mixtures  having  the  same  particle  size.  Methods  are  also 
shown  for  estimating  the  concentrations  of  components  in  mixtures  at  any 
other  combination  of  particle  sizes  by  analyzing  only  one  additional 


calibration  mixture  having  the  new  particle  size  combination.  This 
ouuoiantiaily  reduces  in  comparison  with  conventional  procedures  the 
amount  of  experimental  work  required  to  recalibrate  when  one  or  more 
of  the  component  particle  sizes  varies  upon  changing  lots  or  batches  of 
material. 

Special  attention  is  given  with  numerical  resultB,  to  "Tichloral" 
igniter  mixtures  manufactured  by  the  U>  S.  Army  Missile  Command, 
Redstone  Arsenal,  Alabama.  These  mixtures  are  comprised  of  potassium 
perchlorate,  titanium,  and  aluminum  powders,  and  sometimes  a  small 
percentage  of  a  binder  such  as  polyisobutylene.  The  estimation  procedure 
is  presented  and  "check  samples"  of  known  concentration  (with  particle 
size  differing  from  that  of  the  calibration  data)  were  analyzed  by  the 
procedure. 

The  method  described  here  differs  considerably  from  the  usual 
multiple  regression  technique. 

THEORETICAL,  DEVELOPMENT  OF  PROCEDURE.  Lucas  Tooth  and 
Pyne  t6]  developed  a  theoretical  concentration  -  intensity  model  account¬ 
ing  for  matrix  effects.  It  is  this  model  that  serves  as  the  basis  for  our 
development  (other  models  such  as  a  "complete  quadratic"  polynomial  can 
perhaps  be  used  as  well).  This  model  can  be  expressed  as: 


(1) 


V  = 
n 


+  E 

i<j<q 


i. 

j 


+  i 


S  a(">  I. 


where  V  is  the  percentage  of  component  n  in  the  mixture;  I  is  the 
^  -  (n) 

X-Ray  intensity  for  component  j;  the  a  's  are  constant  parameters 
related  to  mass  absorption  coefficients  [3]  .  a^n)  includes  background 
intensity  when  peak  intensity  measurements  are  made.  The  subscript  (n) 
implies  that  the  parameters  are  characteristic  of  the  component,  i.  e.  , 

the  a's  describe  enhancement  or  absorption  of  the  other  components  with 
the  component.  For  example,  for  a  three  component  mixture,  we  can 
write  the  percentage  of  component  1  in  the  mixture  as: 


(2) 


V  -  J1*  + 

VI  a0 


0)r 

ai  h 


,a> 


+  Q2  +  Q 


<1>I 
3  3 


a<V 

11  1 


“S'a 


+  a 


(Di 

13  ' 


1*3 


Often  terms  beyond  those  describing  a  linear  equation  can  be  deleted  with¬ 
out  serious  consequence. 
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One  might  expect  that  a  classical  least  squares  procedure  for  estimat¬ 
ing  the  coefficients  in  c4ud.Li.ot1  (2)  wouid  be  appropriate.  Actually,  the 
papers  [8]  ,  [5]  ,  and  [l]  rely  heavily  on  this  procedure.  In  the  latter 
paper,  the  authors  use  a  linear  relationship  in  which  the  concentrations  are 
on  the  right  hand  side  of  the  equation,  while  intensity  appears  on  the  left, 
The  coefficients  are  estimated  by  least  squares  and  the  equations  {One  for 
each  component)  are  inverted  for  the  analysis  of  an  unknown.  However,  the 
particle  sizes  of  the  solid  components  were  held  constant  in  the  experi¬ 
mental  work.  If  the  particle  size  effect  is  assumed  to  vary  from  batch  to 
batch  of  raw  materials  that  are  used,  then  the  coefficients  in  (2)  would  be 
dependent  on  particle  size  and  thus  it  would  be  necessary  to  develop  a 
different  relationship  involving  different  coefficients  for  each  batch  of  raw 
materials. 


Experimental  methods  are  presented  here  for  which  the  experimenter 
can  use  concentration  -  intensity  data  under  one  particle  size  condition,  to 


determine  the  percentages  of  components  in  unknown  samples  under  a  second 
particle  size  condition. 


Assumptions  Concerning  Equation  (2).  Suppose  we  consider  the  model 
of  equation  ( 2 ),  We  shall  drop  the  subscript  on  the  coefficients  and  thus 
refer  to  the  relationship  for  component  1, 


V  a  Q 
1  0 


4  q1I1 


a12^1I2 


a13V3 


We  could,  of  course,  write  a  similar  expression  for  components  2  and  3. 

Suppose  we  consider  two  particle  size  levels,  say  I  and  2.  Suppose  we 
have  intensity  -  concentration  data  at  particle  size  2,  but  we  wish  to  esti¬ 
mate  the  coefficients  in  (2.  a)  when  the  raw  materials  are  from  a  batch  at 
particle  size  1.  It  must  be  emphasized  here  that  we  do  not  need  to  know 
what  these  particle  sizes  are;  we  simply  know  that  two  different  conditions 
exist.  We  will  assume  that  the  measured  intensity  of  component  1  at  some 
concentration  level  (V^,  V^,  V,)  and  at  particle  size  2  can  be  written; 

(3)  Xx  =  Ij  4  dj  4  fj 


and  similarly  for  components  2  and  3, 

I  is  the  "effective"  or  true  X-Ray  intensity  oi  component  1  for  the  mixture 
at  concentration  (V^,  V^,  V^),  and  at  particle  size  condition  1. 


d  is  the  particle  size  correction,  i.  e.  ,  the  constant  which  represents  the 
affect  on  the  intensity  of  the  particle  size  difference  (between  level  1  and 
level  2). 

is  the  measured  X-Ray  intensity  of  component  1  when  the  mixture  is 
composed  of  raw  materials  at  particle  size  2. 

f|  is  a  random  measurement  error  effect  on  the  intensity.  It  represents  the 
affect  of  counting  and  other  instrumental  errors. 

Further  discussion  of  and  fj  are  in  order  here,  is  considered  to  be  x 
statistical  "random"  error,  owing  to  inaccuracy  in  measuring  the  intensity. 
The  measurement  error  as  defined  here  includes  components  such  as  the 
counting  error,  and  errors  in  the  preparation  of  samples  and  pellets  from 
the  same  calibration  mixture,  d^  is  not  considered  to  be  a  random  error 
but  rather  a  constant  value  (plus  or  minus)  which  describes  the  affect  on  the 
intensity  of  particle  size  2  over  and  above  particle  size  1.  It  is  assumed 
for  our  purposes  that  the  particle  size  within  a  batch  is  reasonably 
homogeneous.  Otherwise  one  might  consider  dj  to  be  a  mean  or  average 
particle  size  affect.  It  must  also  be  emphasized  here  that  the  dj  represents 
an  affect  on  intensity  of  ingredient  1  of  the  overall  particle  size  of  the 
mixture  and  not  merely  the  particle  size  of  any  one  ingredient.  Finally, 
for  our  purposes,  it  is  assumed  that  d^,  d2»  and  dj  (,vf  the  case  of  a  three 
component  system)  are  independent  of  the  concentration  level  (V^,  ,  Vj). 

This  does  not  appear  to  be  an  unreasonable  assumption  if  the  concentration 
spread  of  interest  is  not  excessive. 

We  shall  now  proceed  to  incorporate  the  model  of  equation  (3)  with  that 
of  (2.  a)  into  a  procedure  for  estimating  the  coefficients  of  (2.  a),  Suppose 
we  have  concentration  -  intensity  experimental  data  for  which  the  basic 
materials  are  at  particle  size  level  2,  We  would  like  to  be  able  to  use  this 
data  to  estimate  (2.  a)  for  materials  at  any  particle  size  level,  Suppose  we 
consider  (2.  a)  in  which  the  materials  are  at  particle  size  level  1.  Substi¬ 
tuting  the  actual  intensities  at  particle  1  into  (2.  a)  yields: 

3  3 

(4)  V.  =  a.,  +  2  a.  (X.-d.-f,)  +  2  a..  (X.-d.-f.)  (X.-d.-f.)  +  «,. 

v  1  0  j=1  J  1  J  J  J  j=i  lJ  v  *  1  1  '  J  J  J  1 

We  have  added  the  usual  ej  (error  term)  as  a  random  term  to.  account  for 
inaccuracies  in  equation  (3)  since  this  equation  is  certainly  not  completely 
deterministic  in  its  derivation. 
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Estimation  of  Coefficients  in  Equation  (3).  Suppose  the  chemist  were 
to  nrpnarp  samples  at  nrAscInr-teH  concentration  levels  and  intensity 
readings  are  taken,  the  ingredients  being  from  a  batch  at  particle  size 
level  2.  We  wish  to  use  this  information  to  obtain  an  estimate  of  the 
concentration  -  intensity  relationship  for  particle  size  1  (or  for  the  ingre¬ 
dients  from  a  batch  at  any  other  particle  size). 

Equation  (4)  can  be  written  as: 

(5)  V,  =  a0  +  alX)  +  „2X2  ♦  a3X3  t  ^X2  +  a12XlX2  +  +  Z 


where; 


v-[ 


,s,  Vj  +  s,  Vi1  +  -  2auWfi' 

j=l  J  J  j=l  J  J 

+  2alldlfl  4  a12  *-dld2+V?J  4  al2  ^dlf24fld2"i2XrflX2 


“  d2XrdlX2^  4  ai3  tdid3+£if3+dif3+fid3‘f3Xi‘fiX3-d3Xi“diX3l 


Thus  the  "error"  associated  with  the  least  squares  model  of  equation 
(5)  is  given  by  equation  (6).  Note  the  terms  that  are  translated  to  Zj 
through  measurement  errors  and  through  the  important  particle  size  effects. 
The  X's  in  equation  (5)  are  the  measured  values  of  the  intensitites  and  thus 
are  random  variables.  One  notices  that  if  the  usual  least  squares  procedure 
is  used,  i.e.  ,  by  minimizing  the  sum  of  squares  of  the  errors  in  determin¬ 
ing  the  estimates  of  the  coefficients,  that  the  error,  Zj,  is  correlated  with 
the  X's,  since  both  involve  the  f's,  This,  of  course,  invalidates  the  usual 
regression  assumption  [9]  that  the  residual  error  and  the  X's  are  independ¬ 
ent.  Of  course,  the.  errors  in  measuring  the  intensities  may  well  be 
negligible,  in  which  case  we  need  only  consider  the  effects  translated  by 
particle  size.  We  shall  discuss  this  situation  in  a  later  part  of  the  paper. 

It  is  not  unreasonable  to  assume  that  these  errors  are  independently 
distributed  with  zero  mean  and  variance  <rf  .  Suppose  we  make  n  observa- 

tions  of  the  type  (X^. ,  X^.i  X^,.  V.).  If  we  sum  both  sides  of  equation  (5) 

over  these  n  values,  we  obtain: 
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23  V. ,  s  M.  +  a.  2  X  +  a,  S  X,,  +  a,  S  X.,  +  a.  .  S  X.,  4  a. ,  Z  X.  ,X, . 
.  ,  li  0  1  li  2  2i  3  3i  11  li  12  li  2i 

1=1 


+  “13  =  XliX31  +  =  ZH  ' 


All  terms  in  equation  (7)  are  known  except  Z  .  The  latter  contains 

■ample  quantities  which  certainly  are  unknown.  For  example,  if  we  were 
to  expand  SZ^,  such  terms  as  -a^Zf^,  a^Zfj^  ,  -2a^Zf^X^,  etc.  would 

appear,  and  since  we  have  no  knowledge  as  to  the  measurement  error  on 
any  given  sample,  these  quantities  are  unknown.  However,  we  can  replace 
these  quantities  by  parameters  that  represent  their  "expected"  or  "average" 
values,  the  latter  which  we  can  estimate  by  a  separate  experimental  proce¬ 
dure.  If  we  assume  the  measurement  error  variable  £.(j=l,2,3)  has 
mean  0  and  variance  <r“  ,  then  J 

j 

E[  Z  fj  =  E[Zf,J  =  E[2f,.]  =  0  . 


Here  the  "E"  notation  refers  to  expectation.  For  a^Zf^  ,  we  can  write 

KiJ  ■  <rf2 


and,  if  we  further  assume  that  the  measurement  errors  are  independent, 

Ef  a  Zf  2 1  =  net  tr  2 

■  11  li  J  ll  fj 

After  performing  these  operations,  we  can  then  write 

ZVU  =  n&0  +  ai(Sxu-nd1)  4  x^-ndj  4  a3(S  x3i'nd3) 

*  ‘u<*4twirMl2i*l|-"'f1>  +  ai2<SV‘2l+ndld2 

(8)  .  1 


-d2Zxii-diZx2i)  4  ai3(Zxlix3i-d3Zxli4nd1d3-d1Sx3i). 
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Equation  (8)  is  unbiased  in  the  sense  that  both  sides  have  the  same 
expectation.  We  have  inserted  "hats"  on  the  a  terms  to  imply  that 
they  will  be  estimated  by  equations  of  this  type. 

For  the  next  estimating  equation,  we  can  multiply  both  sides  of  (5) 
by  Xj^  and  sum  over  the  n  observations  as  before. 

LVliXli  =  a0Sxli  +  alZxli  +  a2Sxlix2i  +  °3Zxlix3i  +allSxli 
+  a12Zxli  x2i  +  a13Sxli  x3i  +  SxliZli  ' 

Zxli^li  conta^-n  unknown  sample  quantities  which  we  shall  once  again 

replace  by  their  expectation.  The  term  i2  f  3  is  replaced  by  ntr,  3, 

3 

which  we  are  defining  as  i  the  third  moment  of  the  distribution  of  f^. 

In  doing  this,  we  arrive  at  the  following  equation; 

..  ^  ..  .A  2  _2  ,  H  l  .  A  la.  i  an  v 


SxliVli  =  a0Zxli  +  ai  (Sxii-n<r£i  “diSxii)  +  a2  (Zxlix2i“d2Zxli) 

+  a3(Ex1.x3i-d3Zxu)  +  ail(2xu+2nd1cr^-no‘^ 

“  3°‘f1Z!xli+dlZxli"2dlSxli^  +  ai2(ZxliX2i+nd2i<rf1 

■^Sx2i+dld2Sxli-d2Sxl2i-dl£xliX2i)  +  *13^4^ 

+  nd3<r^-(rfiSx3i+d1d3Zx1i'd3Zxli-diZxlix3i)  • 

This  equation  also  has  unbiased  property  as  does  equation  (8). 

At  this  stage  we  have  two  estimating  equations.  We  can  proceed  to 
derive  five  more  for  estimating  the  seven  coefficients  in  model  (2.  a). 

We  obtain  these  equations  by  multiplying  both  sides  of  equation  (5)  by 
x2i'  X3i*  XU  ’  xlix2i’  and  xlix3i  and  Per^orm^n8  the  necessary  operations, 
as  described  here  for  the  first  two  equations,  on  Jx,.Z  ,  etc. 

it  1  11  J 1  ll 
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2  2 

Estimation  of  the  d'r,  cr^  ,  etc.  The  quantities  d,  and  cr^  (j=l,  2,  3) 

j  J  j 

which  appear  iri  the  estimating  equations  are,  of  course,  unknown  and 
must  ho  ontl mafrod  hof^ro  we  can  use  the  equations  in  estimating  the  a'e. 

2 

Obtaining  an  estimate  of  <r^  is  quite  easily  accomplished  by  preparing 
several  camples  and  obtaining  intensity  measurements  x^,  •••» 

(independent  of  the  samples  used  in  section  (b)  )  at  some  concentration  and 

2  -  2  " 
computing  =  E(x^-Xj)  /N,  the  sample  variance.  One  can  then  com¬ 
pute  estimates  for  <rj^,  i by  obtaining  similar  sample  variances  for 

the  intensities  of  components  2  and  3.  '  We  can,  of  course,  estimate  the 
third  and  fourth  moments  in  a  similar  manner. 


To  obtain  estimates  of  the  d's,  the  experimenter  needs  first  to  obtain 
replicated  analyses  (on  component  1  for  the  case  of  d  )  for  a  sample  of  raw 

materials  from  particle  size  2.  One  must  then  obtain  similar  readings  for 
the  materials  from  the  batch  of  interest,  in  our  case  this  refers  to  the 
batch  at  particle  size  1.  It  is  important  that  the  two  sets  of  readings  be 

taken  at  the  same  concentration,  One  can  then  obtain  the  averages 

-(21  1 
and  x,  ,  where  the  superscript  denotes  the  particle  size  condition.  The 

1  -(2)  -(1) 

unbiased  estimate  of  is  then  '  -x  ^  .  The  same  procedure  is  used 

to  obtain  estimates  of  d  ^  and  dy 

APPLICATION  TO  IGNITER  MIXTURES.  "  Samples  of  the  igniter 
mixture  were  prepared  at  various  concentrations  of  KCIO^,  Ti,  and  Al. 

The  intensity  for  each  component  was  measured  for  each  sample,  The  data 
is  shown  in  Table  1.  The  overall  particle  size  effect  on  each  intensity  was 
assumed  to  be  the  same  for  these  samples,  and  the  materials  in  this  batch 
were  relatively  "coarse"  for  all  three  ingredients.  Thus  we  shall  refer  to 
the  particle  size  as  c-c-c.  This  is  particle  size  2  in  our  theoretical 
development, 

Experimental. 


Instrumentation- Analyses  were  made  with  a  universal  vacuum  X-Ray 
spectrometer  marketed  by  Philips  Electronic  Instruments.  Spectrometer 
components  consisted  of  an  FA-60  tungsten  target  X-Ray  tube,  a  4-inch 
by  0.020-inch  entrance  collimator,  an  ethylenediamine  D-tartrate  (EDDT) 
analyzing  crystal,  and  a  gas  flow  detector  flushed  with  P-10  gas.  The  X-Ray 
tube  was  operated  at  45KV  -  constant  potential,  and  40  ma.  Pulse  height 
discrimination  was  used  for  the  analysis  of  aluminum. 
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TABLE  1 


j  Concentration-X-Kay  intensity  Data  for  Igniter  Mixtures 

j  *  ; 

£ 

a  *  Potassium  Perchlorate  Titanium  Aluminum  j 


Procedure -Calibration  and  "check"  mixtures  were  prepared  for 
analysis  as  follows;  10  g.  of  each  mixture  including  a  variable  amount 
of  a  cellulose  binder  was  weighed  into  a  1-inch  by  2 -inch  stainless  steel 
vial  and  a  3/8  inch  diameter  plexiglas  bail  was  added  to  the  mixture.  The 
mixture  was  then  blended  on  a  pica  blender  mill  for  10  minutes.  The  ball 
facilitated  blending  without  reducing  the  particle  sizes  of  the  powders. 

Two  5  g.  pellets  of  each  mixture  were  made  in  a  1  l/4-inch  diameter 
pellet  die  under  a  pressure  of  30,000  pei.  The  surface  of  each  pellet  that 
was  against  the  die  plunger  was  subsequently  analyzed. 

Pellet  Samples  were  completely  randomized  among  the  calibration 
mixtures  and  analyzed  in  pairs  in  conjunction  with  a  stable  reference  pellet 
containing  the  same  analytical  elements  as  the  mixtures.  The  reference 
standard  was  used  to  correct  X-Ray  intensities  for  short  and  long  term 
instrumental  fluctuations.  Peak  intensity  measurements  were  made  by  a 
fixed  count  technique  and  recorded  as  corrected  counts  per  second.  Specific 
analytical  parameters  are  given  in  Table  2. 

TABLE  2 

Analytical  Parameters  for  the  Analysis  of  Igniter  Mixtures 

Component  Emission  line  ^Angle^ZQ  Fixed  Counts 

Potassium  Perchlorate  KKa  22,23  200,000 

Titanium  TiKall  49.25  100,000 

Aluminum  AlKa  114.77  50,000 

’■'EDDT  crystal  advanced  approximately  30  2  6 


ESTIMATION  OF  CONCENTRATION-INTENSITY  MODEL  AT  SECOND 
PARTICLE  SIZE  CONDITION.  A  second  batch  of  material  was  considered, 
one  which  contained  relatively  coarse  particles  of  KCIO4  and  fine  particles 
of  Ti  and  Al.  Suppose  one  wished  to  estimate  equation  (2.  a)  for  the 
c -f-f(particle  size  1)  lot  using,  however,  the  available  concentration- 
intensity  data  for  particle  size  2  namely  that  in  Table  1, 

For  the  purposes  of  estimating  the  d^,  a  sample  from  the  c-f-f  batch 
was  prepared  at  19,  18,  and  21  per  cent  KCIO4,  Ti,  and  Al  respectively, 
Duplicates  were  taken  and  the  intensities  in  counts  per  second  obtained 
were;  • 
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Ti 

3.  676. 
3,  646. 


AI 

2,453. 

2,461. 


KC 10 

4 

5,787. 

5,770. 


Point  9  of  Table  1,  with  ingredients  also  at  19,  18,  and  21  per  cent 
concentration  was  used  as  the  appropriate  sample  for  the  c-c-c  batch. 
Subtraction  of  the  average  intensities  was  performed  as  indicated  previ¬ 
ously.  Solution  of  equations  (8)  through  (14),  using  the  data  of  Table  1 
was  then  accomplished  on  an  IBM  7040  computer  for  each  of  the  three 
ingredients.  These  coefficients  are  listed  in  Table  3,  The  coefficients 
can  now  be  used  for  analysis  of  mixtures  for  the  materials  from  the 
c-f-f  lot. 


TABLE  3 


Estimates  of  the  Coefficients  for  Coarse-Fine-Fine  Lot 


KC104(component  1) 

aQ  =  1.80189 
Sj  =  2.28358xl0“3 
&2  =-3. 71806xl0*7 

a3  =  5. 22015xl0-4 
a  =  4.99141xl0"9 
ft  =-4.46347x10“® 
ai3  =  1.47985xl0“7 


Ti  (component  2) 

=  -7.99781 
a  =-4.34502x10“® 
a2  =  3. 68540x1 0” 3 
a3  =  1 . 51577xl0“4 

a22  =  -7.89669x10"® 
ai2  =  4,91218x10“® 
a23  =  1 . 96862x1 0"7 


Al(component  3) 

=  -4.  12577 
at  =  -2.  65274xlo“5 
a2  =  2.87296x10“® 

&3  =  1.43834xl0“2 

a33  =  -6.  38742xl0“7 
an  =  - 1 . 48698x1 0“7 
a23  =  -2.  88265x10“ 7 


Analysis  of  Check  Sample.  More  samples  were  prepared  using 
materials  from  the  c-f-f  lot  in  order  that  the  analytical  equation  for  KC10^ 
and  Ti  could  be  checked.  Notice  that  it  is  only  necessary  in  this  case  to 
analyze  for  two  components.  The  third  can  be  obtained  by  difference 
because  the  a-cellulose  binder  is  added  by  the  analyst  and  is  always  known. 
The  per  cent  of  Al  for  the  "check  samples"  was  computed  by  difference. 
The  results  were  compared  with  the  known  concentration#  in  order  that 
the  quality  of  the  estimating  equations  could  be  evaluated.  In  order  to 
illustrate  the  improvement  obtained  by  the  method  over  that  of  ordinary 
least  squares  without  the  particle  size  correction,  the  results  for  the 
check  samples  were  compared  with  those  obtained  by  estimating  the  inten¬ 
sity-concentration  relationship  of  equation  (2,  a)  by  ordinary  least  squares, 


261 


The  first  sample  contained  the  known  concentration;  25%  KC10  , 
25%  Ti,  25%  Al,  and  25%  a-cel).ulose  binder.  The  intcasities  in  counts 
per  second  were  observed  in  duplicate.  The  results  are; 


KC10 

8,453. 

8,332. 


Ti 

ft,  107. 
8,129. 


Al 

3,353. 
3, 379. 


Using  these  intensities  from  the  duplicates,  the  average  calculated  per¬ 
centage  compositions  (Using  the  coefficients  in  Table  2)  are  below: 


KC101  Ti  Al  * 

24.71  25.59  24.70 


This  indicates  the  agreement  between  actual  and  estimated  concentrations. 
One  would,  of  course,  expect  even  better  agreement  if  the  range  of  con¬ 
centration  of  the  original  data  in  Table  1  were  more  narrow.  The  estimated 
concentration,  using  conventional  least  squares,  neglecting  particle  size 
and  measurement  errors  are: 

KC 10 1  Ti  Al 

20.17  30.40  27.20 


The  difference  between  these  values  and  the  ones  for  our  proposed  proce¬ 
dure  is  primarily  due  to  the  introduction  of  the  d's  into  the  method, 

Additional  samples  from  the  c-f-f  lot  were  prepared  and  the  estimates 
of  concentration  were  obtained,  using  both  conventional  least  squares,  and 
our  procedure,  The  results  are  shown  in  Table  4. 

TABLE  4 


l 

Actual  Concentrations  Predicted  Concentrations  Least  Squares 


Sample 

%K 

%Ti 

%A1 

%K 

%Ti 

%A1 

%K 

%Ti 

%Al 

1 

21 

21 

21 

20.  8 

21. 65 

21,  55 

16,  88 

26.  56 

19,  56 

2 

21 

25 

19 

20.  53 

26.  3 

18.  17 

16.  72 

30.  b 

17.  68 

3 

18 

20 

24 

18.  09 

20.  4 

23.  5 

17.  33 

25.23 

19.45 

Note  the  improvement  in  the  procedure  over  the  least  squares  results, 
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Use  of  a  Linear  Model.  In  many  ease*  of  quality  control  analysis  the 
materials  to  be  analyzed  will  vary  over  small  concentration  ranges  and 
the  procedure  ol  estimating  concentrations  at  a  given  particle  size  and 
compensating  for  recognized  particle  size  changes  can  be  simplified  by 
using  a  linear  model  such  as: 


*1  -  b0  +  blXl  ♦  b2X2  +  Vs  + 


The  same  procedures  apply  to  this  model  and  the  estimating  equations 
are  considerably  more  simple  than  those  for  the  second  order  model 
discussed  in  detail  here. 


Discussion  of  Sources  of  Error.  The  dj  and  the  moments  of  the  f's 
are  based  only  on  sample  estimates.  This  is  obviously  a  source  of  error 
in  the  procedure.  For  the  igniter  system  presented  here,  the  dj  are  based 
on  only  two  observations.  We  would  expect  better  results  on  the  check 
samples  if  we  had  used  more  observations. 


In  many  practical  situations  where  the  X-Ray  fluorescence  technique 
is  used,  the  range  of  interest  in  concentration  would  be  more  narrow  than 
what  we  used  here  (Table  2).  In  practice  one  might  wish  to  narrow  the 
range  of  experimentation  to  insure  the  truth  of  the  assumption  that  the  dj 
are  truly  constant  and  do  not  depend  on  concentration, 


When  determining  one  must  be  sure  to  include  all  source  of  error 

which  cause  Xj  to  differ  from  the  true  intensity  Ij.  As  pointed  out 
earlier  this  involves  more  than  just  making  repeated  measurements  on  the 
same  sample  which  gives  primarily  the  counting  error.  The  error  of 
blending  mixtures  and  preparing  pellets  as  well  as  uncompensated  instru¬ 
mental  mechanical,  and  electronic  variations  must  also  be  accounted  for. 

A  good  estimate  of  the  measurement  error  can  be  easily  obtained,  however. 


The  composition  selected  for  determining  both  dj  and 


should  lie 


close  to  the  center  of  the  calibration  compositions.  Also,  the  calibration 
compositions  should  be  selected  according  to  a  statistically  designed 
experiment  to  insure  accurate  estimates  of  the  coefficients  in  equation  (2.  a). 


In  a  controlled  process  the  normal  variation  of  particle  sizes  among 
lots  of  materials  will  be  smaller  than  the  variation  shown  here  for  sizes 
1  and  2.  These  variations  were  purposely  made  large  to  illustrate  the 
suitability  of  the  method. 
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The  a -cellulose  binder  of  the  igniter  mixtures  was  considered  as  a 
variable  component  in  this  work.  Although  it  could  not  be  analyzed 
directly  by  X-Ray  spectrometry;  the  binder  was  allowed  to  vary  to  simulate 
production  igniter  mixtures  which  may  contain  a  binder  subject  to  pro¬ 
duction  variations  in  the  same  manner  as  the  other  components.  The 
binder,  of  course,  also  results  in  the  formation  of  stronger  pellets,  and 
thereby  allows  a  wider  range  of  composition  to  be  analyzed.  The  binder 
would  normally  be  added  to  the  mixture  in  a  constant  amount  by  the  analyst. 
Results  of  analyses  with  constant  binder  would  probably  be  more  accurate 
than  results  with  variable  binder. 
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DETERMINING  THE  CONFIDENCE  LIMITS  FOR  SOME  TIME 
INDEPENDENT  SYSTEM  RELIABILITY  ESTIMATES  WHEN 
ATTRIBUTE  DATA  FOR  THE  INDEPENDENT  SUB-COMPONENTS 
ARE  GIVEN.  (A  Proposed  Solution  and  Approximating  Formula) 

Eugene  Dutoit 

Picatinny  Arsenal,  Dover,  New  Jersey 

STATEMENT  OF  PROBLEM:  A  problem  that  arises  often  in 
ammunition  engineering  is  estimating  the  reliability  of  some  "one  shot" 
weapon  systems.  This  clinical  problem  is  concerned  with  the  situation 
where  the  only  data  available  are  attribute  (the  fraction:  number  of 
successful  functionings/total  number  of  items  tested)  and  pertain  to  the 
components  of  the  system.  The  ammunition  or  reliability  engineer 
arranges  the  independent  system  components  in  some  logical  configura¬ 
tion  (called  the  reliability  block  diagram)  and  he  constructs  a  mathematical 
model  of  the  overall  system  reliability.  Established  procedures  do  exist 
for  determining  the  reliability  of  each  separate  component  at  any  appro¬ 
priate  confidence  level,  but  this  problem  of  interest  is  to  establish  some 
techniques  for  combining  these  component  data  so  that  some  reliability 
estimate  can  be  made  about  the  system  (note;  -no  "system"  data  are 
available)  at  any  desired  confidence  level.  In  essence,  this  problem  is 
hopefully  designed  to: 

(1)  Raise  some  interest  and  thought  for  this  problem  which  appears 
to  have  been  treated  too  lightly  considering  the  frequency  with  which  it 
arises.  Perhaps  someone  who  might  be  writing  or  considering  to  write 
a  textbook  on  reliability  might  develop  a  computational  procedure  for 
publication  and  reference.  The  use  of  computer/simulation  studies  have 
already  been  proposed.  These  methods  may  be  applicable  when  a  computer 
is  available  and  time  is  not  a  crucial  factor,  but  we  are  seeking  a  solution 
that  would  give  a  quick  but  good  approximation  to  some  rigorous  and 
lengthy  solution. 

(2)  Encourage  the  examination  of  data  indicating  the  distribution  of 
failures  for  conventional  weapon  systems  to  determine  if  some  character¬ 
istic  distribution  can  be  used  to  describe  some  types  of  items.  This 
paragraph  reflects  similar  statements  made  by  Lt.  Colonel  M.  S.  Hochmuth 
during  the  "opening  remarks"  of  this  conference. 

AC  KNOW  LEDGMENTS:  Before  continuing  with  some  proposed  solution 
and  approximating  technique  I  would  like  to  express  my  appreciation  to  the 
Army  Mathematics  Steering  Committee  for  giving  me  the  opportunity  to 
present  this  clinical  problem  at  the  "Twelfth  Conference",  I  am  also  appre¬ 
ciative  to  all  the  panel  members  (Dr.  F.  Frishman,  Chairman;  Mr.  O.  Bruno, 


Professor  A.  Cohen,  Jr.  ,  Professor  B.  Harahbarger,  Dr,  J.  Rosenblatt 
and  Professor  H.  Solomon)  who  offered  constructive  suggestions/comments 
either  at  the  meeting  or  by  writing. 

I  also  wish  to  thank  Mr.  Stuart  Ritter  who  developed  the  computer 
program  and  charts  used  in  this  work. 


A  PROPOSED  SOLUTION:  The  author  of  this  report  has  independently 
arrived  at  a  "similar"  solution  to  the  problem  as  Mr.  H.  DeCicco  [  1  ]*  and 
Messrs.  Lloyd  And  Lipow  [4]  ,  therefore  the  derivations  presented  here 
shall  be  "quick  and  dirty".  The  interested  reader  should  consult  these 
references,  and  the  other  sources  cited  in  this  report,  in  order  to  become 
more  familiar  with  the  problem.  Those  readers  who  are  Interested  in 
researching  the  problem  might  compare  this  enclosed  solution  or  some 
other  possible  solutions  with  each  other  to  determine  if  some  extra  degree 
of  accuracy  obtained  by  a  more  rigorous/analytic  method  is  worth  the 
extra  effort.  DeCicco  mentions  in  his  paper  [1]  that  it  is  "unrealistic  to 
expect  serious  support  for  assurance  to  more  than  two  significant  digits". 
This  criterion  might  be  used  to  determine  significant  differences  between 
all  proposed  solutions  to  this  problem.  This  proposed  solution  will  be 
reduced  to  some  approximation  and  graphic  procedure  which  will  hopefully 
simplify  the  computation  for  non-mathematically  oriented  personnel. 


Notation:  Number  of  "y"  components  in  parallel  in  set  i,  where  the 
reliability  of  each  item  at  the  50%  confidence  level  is  r^. 

*  Numbers  in  brackets  indicate  references  at  the  end  of  paper. 
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The  derivation  of  the  "error  propagation"  formula  ia  well  known 
and  need  not  be  discussed  here  (see  Bowker  and  Lieberman;  Engineering 
Statistics,  Prentice  Hall,  Inc.,  Englewood  Cliffs,  N.  J.  ,  1959,  page  62). 
Given  a  function  of  m  variables  f(r^,  r^,  •  •  •  ,  *m)  with  expected  values 

r. ,  J . $  ,  the  expected  value  of  the  function  is  approximated  by: 

12  m 

E[f(^,  r g,  •  •  •  ,  r  )]  =  f(*ji  ^2’  *m)  with  approximate  variance; 


VAR  r2,  ....  rm)] 


VAR  (r^ 


•  •  •  +  VAR  (r  ) 
m 


Considering  the  general  series-parallel  configuration,  the  equation 
for  the  reliability  of  this  system  is: 


Equation  (l)  corresponds  to  the  expected  value  of  the  function.  The 
variance  of  equation  (l)  is 


Consider  that  r^  =  number  of  successful  functionings/total  number 
fired  or  tested,  where  r.  is  a  best  estimate  of  a  proportion  describing 
a  population  where  a  proportion  r^  of  the  individuals  have  a  certain 
characteristic  and  a  proportion  1-r^  of  the  individuals  do  not  have  it. 

If  is  the  best  estimate  of  some  binomial  parameter  r^,  then  the 
variance  of  r^  is: 
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(3) 
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Where  n  »total  number  tested  of  item  i.  The  general  term  for  equation 
(2)  is: 


(4) 
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The  total  variance  of  the  system  reliability  estimate  is; 

m 
S 
i=l 
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Equation  (l)  describes  the  nominal  value  of  the  true  system  reliability 
R,  namely  R  and  equations  (4)  and  (5)  give  the  variance  of  the  system 
estimates.  In  the  area  of  convential  ammunation  reliability,  we  are 
interested  in  computing  the  lower  90%  confidence  limit  of  R.  This  is  done 
in  the  usual  way: 

(6)  90%  C.L.  R  =  R  >  A 


where  "A"  depends  on  the  distribution  of  R. 

Since  we  have  no  data  for  the  overall  system  performance  (reference 
second  part  of  Statement  of  Problem  in  this  report)  it  was  decided  to  use 
distribution-free  methods  -  see  reference  [1]  .  Chebyshev's  inequality 
states  that  the  amount  of  area  under  any  distribution  which  is  farthe^r 
away  from  the  mean  than  "A"  standard  deviation  units  is  less  than  — jp  • 
This  is  described  in  figure  (2)  below:  *  A 
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"A"  ia  determined  so  that  at  least  90%  of  the  distribution  is  explained; 
i.e.  ,  the  shaded  area  must  be  no  larger  than  10%.  Applying  the  above 
theorem: 


A  *  3.  16 


therefore  equation  (6)  becomes: 

(7)  90%  CL.  R  >  k  -  3.  16 


SERIES  SYSTEMS:  The  most  common  case  of  conventional  ammunition 
reliability  assessments  have  been  on  systems  without  replicated  compo¬ 
nents.  Referencing  figure  (l)  and  letting  a*bs.  .  .  =k=l  we  have  the  following 
condition: 


Figure  (2) 


Equation  (1)  becomes; 

m 

(8)  *' V  V"  '»  *  ",  ' 


Equation  (4)  becomes: 


So  that  equation  (5)  is: 


=  My 

isl 


'  }i-l  ?i+l 


*i  (>■*.) 


The  values  obtained  by  equations  (8)  and  (10)  are  then  "stuffed  into" 
equation  (7)  to  obtain  the  lower  90%  confidence  limit  on  the  system 
reliability. 
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Example;  Consider  two  (2)  elements  in  series: 


*  2 

Figure  (3)  on  the  next  page  gives  the  estimates  of  R  and  o>  for  2  through 
5  components  connected  in  series, 


PARALLEL  SYSTEMS;  If  S  components  are  arranged  in  a  parallel 
configuration,  each  component  with  reliability  i  and  all  S  components 
must  fail  for  the  system  to  fail,  then  by  applying  equations  (l),  (5)  and 


R 
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then 


°*ft 


S(l -x)8"1 


i 


therefore; 

1 

(11)  90%  CL.  R  =  1  -(1  -r)S-(3.  16)(S)(l -r)8"1  J 


APPROXIMATION  FORMULAE.  The  aforementioned  equations  can 
be  cumbersome  to  work  with  and  (a s  mentioned  earlier  In  the  paper)  it 
might  be  useful  if  some  approximation  technique  could  be  used  in  its  place. 


Series  Case.  Consider  the  estimate  of  the  system  reliability  ft 


Where  - 


(8) 


R  = 


m 

n 

i=l 


tk 

Suppose  we  were  to  "assume"  that  R  was  on  estimate  of  some  binomial 

a  2  2 

parameter  R.  The  estimated  variance  [e  * ]  of  ir^  would  be* 


(12) 


R£Kk) 

n 


The  value  of  n  would  be  chosen  so  that  r*  would  be  a  conservative 
maximum.  ^ 


If  the  sample  sizes  n  are  the  same,  then  this  common  sample  size 

should  be  used  in  the  denominator!  if  n.  /  n.,  then  the  minimum  value 

l  r  i 

of  n  should  be  used  in  order  to  maximize  fr-  . 
re-written  as:  ^ 

(12)' 


Equation  (12)  can  be 


ft 


ki  -  ft) 

n(min) 


Equation  (7)  now  becomes; 


(13) 


90%  C.L.  R  >  R  -  3.  16  ? 


ft 
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The  difference  between  the  proposed  solution  (equation  7)  and  its 
approximation  (equation  13)  was  examined  in  a  general  fashion.  From 
this  cursory  ir.vs ctigaticr. ,  the  difference a  between  these  methods  can 
range  from  about  0.  00  (two  significant  decimal  places  -  see  DeCicco) 
to  roughly  0.05  at  extreme  conditions.  It  appears  that  this  approximation 
becomes  more  effective  for  r^  -*  1.  0  or  large  values  of  n^  or  both. 

Parallel  Case;  The  estimate  of  the  system  variance  (equation  12) 
could  be  applied  to  parallel  system  configurations.  The  range  of  differ¬ 
ences  has  not  been  investigated  but  it  is  expected  to  be  in  close  agreement 
with  the  series  situation. 


GRAPHIC  PROCEDURE:  As  stated  earlier,  it  would  be  useful  to 
reduce  the  computations  of  both  approaches  to  the  problem. 

Proposed  Solution  -  Series  Case  -  Equation  (4)  can  be  substituted 
into  equation  (2)  to  express  the  total  variance  of  the  system  reliability 
estimate  (expanded  form  of  equation  5): 


(14) 
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m 


In  the  series  case  a=x=y=z=k=.  .  .si,  and  by  factoring  out  II  r. 


we  obtain: 
(15) 
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,  m  r  •  r  •  •  •  r.  r.  •••r  (1-r.) 
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By  letting 
(16) 
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Equation  (15)  becomes 

(17) 
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25  <  «i  <  200 


n  qo 


2  <  <  5 


From  these  initial  boundary  intervals  the  range  of  values  for  and 
S|^  were  determined  to  be; 

.  005  <  $  <  .  150 

50xl0"6  <  Z§  <  30,000  xl0"6  . 


Figures  4  and  5  have  been  worked  out  per  the  above  ranges  of  values  and 
are  presented  in  the  appendix  as  Figures  4'a,b  and  5'a,b. 

Approximate  Solution  -  Series  Case;  Equation  (13)  is: 


90%  C.  L.  R  >  R  -  3.16 


Skill* 

n(Min) 


which  is  defined  by  R  and  n(Min).  These  parameters  will  be  assumed  to 
have  the  following  range  of  values- 

.85  <  R  <  .99 

25  <  n(min)  <  200 

-  so  that  the  following  graph  can  be  determined: 


90%  C.  L.  R  > 


n(Min) 


Figure  (6) 

The  details  for  Figure  (6)  are  given  in  the  appendix  as  Figure  6' 


i 


This  method  is  certainly  much  easier  to  use  than  any  of  the  previous 


Proposed  Solution  -  parallel  case  -  Consider  equation  (11) 


90%  C.  L.  R-l-(l-r)"  -  (3.  16)  (S)  (l-r)*"1  j  *  which  is  defined 

by  s  number  of  components  in  parallel,  ^reliability  of  each  component 
in  the  parallel  network  and  n=the  same  sise  used  to  compute  r.  For 
practical  purposes  let  «>2  and  3  components  in  parallel. 

The  following  graphs  can  be  constructed: 


which  is  defined 


90%  C .  L>.  R> 


Figure  (7) 

The  details  are  given  in  the  appendix  as  figure  7'a,b. 

CONCLUDING  REMARKS;  The  above  procedure  is  a  proposed  "type 


of  answer  examplifying  the  kind  of  solution  requested.  Any  solutions  to 
this  problem  that  can  be  published/circulated  as  a  standard  reference 
would  be  appreciated. 
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STATISTICS,  PROBABILITY,  AND  DETERMINISM  IN  A 
RELIABILITY  IMPROVEMENT  PROGRAM 


Woodie  R.  Jenkins,  Jr. 

National  Range  Operation* 

White  Sanda  Missile  Range,  New  Mexico 

The  Data  Collection  Directorate  of  White  Sands  Missile  Range  (WSMR) 
is  presently  engaged  in  the  task  of  increasing  the  probability  of  obtaining 
usable  data  from  several  data  gathering  systems.  These  systems  are  used 
on  various  projects  to  collect  vehicle  performance  data.  The  projects  are 
tests  of  weapon  systems.  The  data  gathering  systems  are  optical  cameras 
and  electronic  instruments  used  to  measure  the  position,  velocity,  attitude, 
events,  and  internal  status  of  test  vehicles.  The  probability  of  obtaining 
usable  data  is  the  "Reliability"  that  is  referred  to  in  this  paper.  Data 
records  are  obtained  by  instruments  of  the  optical  and  electronic  systems, 
and  the  records  are  assessed  "Usable  in  Reduction"  or  "Unusable  in 
Reduction"  by  the  WSMR  Data  Reduction  personnel. 

It  is  the  policy  of  the  Data  Collection  Directorate  to  allow  a  data  gather¬ 
ing  system  to  exhibit  a  total  fraction  of  unusable  records,  over  a  given  time 
period,  that  does  not  exceed  PQ.  In  other  words,  if  U  =  the  number  of 
unusable  records  and  I  =  the  number  of  attempts  to  obtain  data,  then  the 
fraction  of  unusable  data  obtained  by  a  system  over  a  given  period  of  time  is 


[Note  that  U/l  is  a  measure  of  the  unreliability  of  the  system,  and  one  minus 
the  unreliability  is  the  reliability  of  the  system.  ]  And,  in  order  for  the 
process  of  obtaining  usable  data  to  perform  in  an  acceptable  manner, 


P  Must  be  <  P  . 

—  o 

When,  over  a  specified  period  of  time,  P  >  PQ,  then  the  Directorate  must 
take  action  to  improve  its  data  gathering  reliability. 

It  is  the  P  >  Pq  problem  that  we  address  ourselves  to  in  this  paper. 

The  question  to  be  answered  is  "What  action  must  the  Directorate  take 
in  order  to  ensure  that  P  will  be  <  PQ  for  the  next  equal  sampling  period?" 
It  is  my  hypothesis  that  "The  areas  that  should  be  controlled  can  be  found 
by  determining  the  most  significant  ditferences  between  the  deterministic 
relationships  that  existed  at  the  time  the  unusable  records  were  obtained  by 


gsm 


l  I 


■- 


specific  instruments  and  the  relationships  that  existed  at  the  time  the 
usable  records  were  obtained.  This  requires  that  the  same  instruments 
at  the  same  locations  be  operated  by  the  same  personnel  on  the  same 
projects  in  both  cases.  Moreover,  hypotheses  about  how  to  control 
physically  the  appropriate  deterministic  relationships  can  be  formulated, 
tested,  and  verified  with  satisfactory  results.  " 

[if  other  hypotheses  are  made  available,  1  will  certainly  consider  them.  ] 

Once  the  relationships  or  parameters  that  must  be  controlled  are  known, 
then  statement 2)  of  my  hypothesis  can  be  performed. 

The  following  example  illustrates  how  statement  (1)  of  my  hypothesis 
can  be  accomplished, 

Let  us  say  that  we  must  assure  ourselves  that  the  P  >  PQ  condition 
for  sample  (l)  will  be  a  P  <  PQ  condition  for  sample  (2)  for  the  tracking 
camera  system  (cinetheodolites).  Sample  (1)  is  the  original  data  for  which 
P  >  P0.  Sample  (2)  is  the  necessary  and  sufficient  amount  of  data  needed 


to  make  a  decision  about  whether  the  controlled  process  yields  P  < 

The  following  observation  was  obtained  from  all  of  the  sample  (l)  data. 


o' 


Frequency 

of 

Occurrence 


‘  ! 


(1)  (2)  (3)  (4)  (5)  (6)  (7)  (8)  (9)  (10) 

Reasons  for  Being  Unusable 

Figure  I 


From  the  definition  of  P  [Eq.  ( 1)]  it  can  be  seen  that 

(2) 


P  = 


E  Frequencies 
Reasons 


If  there  is  no  reason  to  expect  that  the  P  for  sample  (2)  will  be ‘significantly 
different  from  the  P  for  sample  (1)  if  the  process  were  left  unchanged  and  if 


(3) 


E  Frequencies  -  Frequency  of  Reason  (l) 
Reasons 
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<  P  . 
—  o 


then  each  occurrence  of  reason  (l)  should  be  analyzed  for  the  deterministic 
conditions  or  relationships  that  existed  at  the  time  that  thd  data  records 
were  obtained. 


If  reason  (l)  is  identified  as;  "Insufficient  Coverage",  then  the  equation 
describing  the  probability  of  obtaining  "Sufficient  Coverage"  by  a  camera  is 
derived  as  follows. 


Sufficient  coverage  is  defined  as  the  required  number  of  consecutive 
frames  of  data,  M0,  for  any  optical  system.  If  a  clnetheodolite  is  assigned 
to  operate  on  a  project  from  time  t^  to  time  t'  at  a  data  gathering  rate 

of  r  frames  per  unit  tims,  then  the  expected  total  number  of  frames  of 
data  Is 


(6) 


Mt '  - 1 ')  . 

*  n  a' 


If  r  ®  the  obtained  frame  rate  and  [t  ,  t  ]  is  the  time  interval  over  which 

&  IX 

the  camera  operated,  then  the  total  number  of  frames  of  data  obtained  is 


(7) 


(t  -  t  ) 
1  n  a' 


Also,  if  eQ  t  and  t  a  azimuth  and  elevation  angles  respectively  of 

the  optical  axis  of  the  camera  at  time  t,  if  6t  and  $t  =  the  azimuth  and 

elevation  angles  respectively  of  the  aerial  target  to  be  tracked  at  time  t,  and 
if  j3g  and  |3^  are  the  angular  sized  of  the  camera1  field  of  view  in  the 

horizontal  and  vertical  planes  of  the  camera  respectively,  then  it  can  be 
shown  that  the  aerial  target  ie  contained  in  the  camera's  field  of  view  ii  and 
only  if 


6 


o.t 


+ 


57.  3 
2 


P*  >  04 


57.  3 
2 


P 


0 


(») 


and 


o,t 


§ 


o,t 


are  satisfied  simuitaneously.  Note  that  since  0  and  J  are  in  degrees 
and  and  (3^  are  In  radians,  57.  3  coverts  (3Q  and  [3^  into  degrees. 

If  we  call  A.  the  probability  of  the  camera  acquiring  the  aerial  target  at  the 
instant  t  and  use  the  concept  of  Delta  Functions,  then 
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(9)  At 


K7  3 

e  .  +  >  e*  >  8  * 

O;  t  2  0  t  0,t 


*  *IL2  ft  ^  ^  *  Hi!  a 

^o,t  2  ^o.t  2 

0,  Otherwise 


Since  we  must  have  at  least  M0  number  of  consecutive  frames  of  Aj.  *  1 
over  [t  i  t  1  in  order  to  have  sufficient  coverage,  then  a  concise 

“■ '  ~W'"  ti 

mathematical  statement  of  the  required  condition  is  defined  as  follows. 

The  camera  operates  at  a  frame  rate  of  r  frames  per  second.  The 
time  required  to  obtain  one  frame  of  film  is  At,  where 


At*; 


Moreover,  if  it  takes  At  units  of  time  to  obtain  one  frame  of  film,  then 
it  takes  MQAt  units  of  time  to  obtain  MQ  consecutive  frames  of  film. 
Therefore,  sufficient  coverage  is  obtained  if  and  only  if 


t  -  M  At 
n  0 


t.  =  t 
1  a 


t.  +  M  At 
1  o 


A*  >  1  , 


t  *  tj  +  A  t 


where 


(12)  t.  =  {t  ,  t  +  M  At,  t  +  2M  At,  t  +  3M  At, . .  .  ,  t  -  2M  At,  t  -  M  A t}. 
v  '  1  1  a  a  o  a  o  a  o  n  o  noJ 


If  there  is  any  sampled  instant  in  M0  consecutively  sampled  instants  for 
which  =  0,  then  the  product  term  of  equation  (11)  is  zero  for  that  series 
of  frames.  If  all  such  series  of  frames  yield  product  terms  of  zero,  then 
the  film  record  will  surely  be  assessed  unusable  due  to  insufficient  coverage. 
Again  relying  on  the  Delta  Function  concept,  the  probability  of  having 
obtained  sufficient  coverage  ia  given  by 
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Equations  (9)  and  (13)  provide  a  means  for  attempting  to  find  a  physical 
cause  for  each  occurrence  of  unusable  records  due  to  insufficient  coverage. 
For  example,  the  following  relationships  can  be  compared  by  using  both 
usable  and  unusable  data  for  each  station  (camera)  that  obtained  unusable 
records  due  to  insufficient  coverage. 

After  analyzing  Figure  II,  we  will  be  in  a  position  to  formulate 
hypotheses  about  how  to  control  physically  the  relationshlp(s)  exhibiting 
the  most  significant  differences  between  the  usable  and  unusable  data  for 
a  given  camera  on  a  specific  project, 

The  above  discussion  has  illustrated  my  approach  to  solving  the  P  >  PQ 
problem.  Since  the  proposed  method  has  not  been  tried  as  yet,  1  am 
seeking  an  evaluation  of  the  method  along  with  alternate  approaches  to 
solving  the  problem.  1  will  now  entertain  questions  and/or  comments 
about  this  problem. 


A  COMPUTERIZED  PROCEDURE  FOR  WRITING 
MATHEMATICAL  MODELS  FOR  SYSTEMS  RELIABILITY 


John  G.  Mardo  and  Anthony  J.  Ricciardi 
Mathematics  and  Statistics  Branch,  Nuclear  Reliability  Division, 
Quality  Assurance  Directorate,  Picatinny  Arsenal 
Dover,  New  Jersey 

I,  ABSTRACT ,  A  method  for  the  determination  of  mathematical 
models  for  the  reliability  of  missile  adaption  kit  (AK)  systems  is  pre- 
sented.  The  method  entrusts  of  a  computer  program,  the  input  of  which 
ij  a  Boolean  expression  of  the  system  configuration.  The  program 
constructs  a  success-failure  tree  from  the  Boolean  expression  resulting 
in  all  possible  success  paths  for  the  system.  The  union  of  these  success 
paths  is  the  reliability  model  for  the  system.  The  number  of  components 
and  not  the  complexity  with  which  they  are  combined  limits  the  use  of  the 
present  procedure. 

II,  ACKNOWLEDGEMENT.  The  assistance  of  Bruce  Barnett  of  the 
Data  Processing  Systems  Office  of  Picatinny  Arsenal  in  the  development 
of  the  techniques  used  in  this  report  is  acknowledged.  In  particular,  his 
contributions  to  the  underlying  theoretical  aspects  are  appreciated. 
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Ill  Introduction 


The  purpose  of  mathematical  estimations  of  reliability  in  any  sta”? 
of  the  life  cycle  of  a  system  is  to  determine  the  expected'  probability  cf 
successful  functioning  in  use.  These  pst.1 me+.l  nn«  should  cnticipsts 
potential  reliability  problems  and  reveal  system  configurations  that  have 
greatest  probabilities  of  failure  in  use. 

An  important  tool  in  evaluating  these  estimates  is  the  reliability 
equation.  This  equation  is  a  mathematical  model  of  the  system  under 
consideration,  relating  the  reliability  of  the  system  to  the  reliability 
of  the  components  which  comprise  it.  For  complex  electrical  systems, 
these  equations  are  difficult  to  obtain.  The  difficulties  encountered 
are  dependent  upon  the  number  of  components  in  the  system  and  the  degree 
of  complexity  of  the  configuration. 

The  dependent  operations  of  the  various  components  require  the  use 
of  conditional  probabilitiea  in  developing  the  mathematical  models.  The 
determination  of  these  conditional  probabilities  is  difficult;  as  a 
result,  reliability  equations  for  complex  systems  are  usually  approximations 
based  on  the  assumption  of  independence.  Simplified  models  of  the  system 
are  presently  used  which  ignore  the  less  likely  modes  of  operation. 

Although  numerical  estimates  obtained  from  models  which  represent  the 
system  exactly  do  not  differ  markedly  from  those  that  would  result  from 
approximate  methods,  there  are  a  number  of  advantages  in  using  the  more 
exact  method.  Arguments  as  to  the  validity  of  the  model  used  for  the 
analysis  ere  largely  eliminated  because  the  "model",  in  this  case,  is  the 
most  complete  mathematical  representation  of  system  operation  possible. 

The  ability  to  handle  large  numbers  of  components  permits  breaking  down 
the  system  into  very  email  elements  and  the  reliabilities  of  these  small 
elements  can  be  established  with  greater  confidence  and  can  be  established 
by  testing  at  less  expense.  Finally,  the  equations  arising  from  this 
analysis  permit  component  effect  studies  on  a  more  realistic  level,  since 
a  more  exact  representation  of  a  component's  role  in  the  operation  of  the 
system  is  given  by  the  resulting  equation. 

An  automated  procedure  will  be  presented  for  analyzing  systems.  This 
procedure  results  in  a  reliability  equation  which  is  a  mathematical  model 
representing  the  system.  The  primary  purpose  of  this  automated  procedure 
is  the  determination  of  success  models  in  the  shortest  length  of  time  by 
the  most  economical  means.  Complicated  networks  require  months  of  manual 
effort  to  determine  reliability  models  even  with  the  previously  discussed 
approximations.  Using  the  computer  procedure  to  be  discussed,  it  is 
necessary  to  understand  the  logical  functioning  of  the  system.  *  With  this 
understanding,  it  will  reqire  only  a  few  days  rathor  than  months  to  derive 
the  final  algebraic  equation  using  computer  techniques. 
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IV  Foundations  of  the  Computerized  Procedure 

Consider  the  system  composed  of  the  components  Ci,  Ca,  Cn  arranged 
in  a  configuration  widen  makes  ordinary  parallel-series  reliability  analysis 
of  the  network  difficult.  Interdependency  of  component  opera  lion  causes 
such  a  situation.  It  follows  from  Beyefs  Theorem  that 

(1)  P<f)  -  P(f|Ci)P(C1)+P(f|c1)P(Ci) 

where:  P(f)  »  probability  of  the  system  functioning 

P(f|Ci)  -  the  probability  of  the  system  functioning  given  that 
component  operates  correctly 

?(C±)  -  probability  of  component  Ci  operating  correctly 

P(f|Ci)  and  P(Ci)  are  defined  similarly  where  Ci  represents 

the  event  where  component  Ci  fails  to  operate  correctly 

P(Ci)  is  the  reliability  of  the  component  Ci  and  P(Ci)  - 

1  -  P(Ci) 

Equation  (1)  would  be  the  desired  expression  of  the  system  reliability 
in  terms  of  the  component  reliabilities  if  the  conditional  probabilities 
P(f|Ci)  and  P(f|Ci)  were  evaluated  either  numerically  or  as  functions  of  the 
component  reliabilities.  The  computer  program  listed  in  Appendix  A  performs 
these  evaluations  of  the  conditional  probabilities  by  using  the  Boolean 
algebraic  expression  which  represents  the  logic  of  the  system  operation. 

This  expression  is  a  function  B(Ci,  cj,  . C*)  which  takes  on  the  values 
1  or  0  representing  system  success  or  failure,  respectively,  where  cj"  is  a 
variable  which  takes  on  the  value  1  or  0  depending  on  whether  component 
Cj_  operates  or  fails  to  operate,  respectively. 

-  It  is  possible  using  this_function  B  to  evaluate  the  conditional 
probabilities  P(f|d)  and  P(f|Ci).  If,  when  cf  is  given  the  truth  value 
1  in  the  Boolean  function  B,  and  all  other  Cj,  where  j  /  i,  are  given  truth 
values  n,  the  value  of  B  is  1,  then  P(f|Ci)  »  1.0.  However,  if  B  ■  0 
then  ?(f|Ci)  cannot  be  determined  directly  and  P(f|d)  must  be  further 
expanded  as  follows! 

P(f|Ci)  -  P(f  |Cj.Cj )P(Cj)+P(f|CiCj )P(CJ) for  any  j  /  i 


Similarly,  when  cf  is  given  the  value  Q.  in  B  and  all  other  cf,  where 
j  /  i,  ar&  given  values  1,  B  ■  0,  then  P(fjC^)  *  0.0.  However,  if  B  =  1, 
thou  F(fjCi)  utin  nuL  be  determined  directly  and  thus  may  also  require 
further  expansion  as  follows: 

P(f|Ci)  -  P(f|CiCj)P(Cj)+P(f|CiCj)P(Cj)  for  any  j  /  i 

At  this  point,  an  attempt  is  again  made  to  evaluate  the  conditional 
probabilities  using  the  function  B.  The  procedure  is  continued  until  all 
the  conditional  probabilities  have  been  eliminated  by  substitution  of 
either  their  numerical  equivalents  or  these  conditional  probabilities 
expressed  as  combinations  of  the  individual  component  reliabilities.  When 
this  point  is  reached,  the  P(f)  has  been  expressed  algebraically  as  a 
combination  of  the  individual  component  reliabilities  and  the  program  is 
terminated. 


Applying  this  procedure  to  the  following  circuit: 


the  Boolean  expression  for  the  circuit  is 


FIG.  1 


SYSTEM  =■  B(A, B,C)  »  (A+B)*C 
Expanding  as  described  above  using  Baye's  Theorem: 

1.  P(f)  -  P(f|A).P(A)+P(f|A>P(A) 

2.  P(f|A)  *  P(f|AB)*P(B)+P(f|AB>P(B) 

3.  P(flA)  *  P(f|AB)-P(B)+P(f|AB)-P(B) 

l*.  P(f | AB)  -  P(f|ABC)'P(C)+P(f|ABO)*P(C) 

5.  P(flAB)-  P(f |ABC) *P(C)+P(f | ABC) • P(C) 

6.  P(f | AB)  -  P(f| ABC)'P(C)+P(f IaBC)‘P(C) 


Flrom  the  Boolean  Expression  it  follows  that: 

P(f|AB)  -  P(f| ABC) ■  P( f | ABC)  -  P<f|ABC)  -  0  and  P(f |ABC)  - 
P(f| ABC)*  P(f|ABC)  »  1 
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P(f)  -  P(C)*P(B)«P(A)+P(A)‘P(C)*P(I)+P(n)*p(*).p(r) 
p(f)  ■  [P(B)-P(A)+P(AKl-P(B))+P(BKl-P(A))]-P(C) 
p(f)  -  [P(A)-P(B)+P(A)-P(A)*P(B)+P(B)-P(A).P(B)].P(C) 

P(f)  -  (P(A)+P(B)-P(A)‘P(B))‘?(C) 

the  algebraic  reliability  equation  for  the  circuit  shewn 

in  rXCU  If 


V  Application  of  Computer  Procedure 

The  following  example  demonstrates  the  computer  method  of  handling 
the  procedure  on  a  simple  circuit.  Consider  the  circuit  with  components 
A,  B,  Ci,  Di,  C2,  Eg  in  the  figure  below: 


The  Boolean  logic  diagram  is  converted  into  a  Boolean  expression 
using  Boolean  algebraic  techniques.  The  resulting  expression  for  the 
above  diagram  is  as  follows: 

(1)  STSTEM  -  A(Ci+BD2)+B(Di+AC2) 

This  expression  is  then  programmed  using  whatever  means  are  available 
in  the  programming  language  being  used. 

The  next  step  is  to  set  up  an  "order  of  consideration"  of  the  components. 
This  will  be  A,  B,  Ci,  Eg,  Di,  C8. 

The  steps  that  follow  are  handlpd  by  the  computer  as  follows: 

Using  the  Boolean  expression,  a  "tree"  is  generated  within  the 
computer.  Such  a  tree  will  now  be  generated  for  the  circuit  under  discussion. 

_Symbol  (A)  is  used  to  represent  the  success  of  component  (A). 

Symbol  (A)  is  used  to  represent  the  failure  of  component  (A).  Similar 
notations  are  used  for  components  B,  C,  and  D.  Starting  with  component  A 
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(since  A  .is  the  first  component  under  the  grder  of  consideration)  the 
associated  success-failure  symbols  (A  and  A)  are  used  as  the  first  two 
branches  of  the  tree.  To  determine  how  far  to  continue  a  branch,  each 
ViT»«nnVi  -t *  nr*  examined  using  the  following  rules: 

1.  Starting  with  the  A  branch,  assign  the  value  1  to 
component  (A)  and  the  value  C  to  all  the  remaining  components  in  the 
system.  Substitute  these  truth  values  into  the  Boolean  expression  for 
the  system  and  determine  whether  this  conibination  of  values  causes  a 
system  success  or  a  system  failure.  If  the  result  is  a  system  success, 
end  the  A  branch  of  the  tree.  If  the  result  is  a  failure,  plan  to 
continue  the  A  branch  by  adding  tho  two  branches  (B  and  B)  of  component  B. 

2.  Using  the  A  branch,  assign  the  value  0  to  component  (A) 
and  the  value  1  to  all  remaining  components  in  the  system.  Substitute 
these  truth  values  into  the  Boolean  expression  for  the  system  and 
determine  whether  this  combination  of  values  causes  a  system  success  or 
a  system  failure.  If  the  result  is  a  system  failure,  end  the  A  branch_ 
of  the  tree.  If  the  result  is  a  system  success,  plan  to  continue  the  A 
branch  by  adding  the  two  branches  (B  and  B)  of  component  B. 

3.  Continue  to  generate  the  tree  diagram  jy  adding 
components  and  testing  each  branch  of  each  component  for  termination  or 
continuation  as  described  above.  The  expression  that  describes  the 
success  path  to  the  last  component  in  the  branch  can  be  used  to  develop 
the  algebraic  equation  for  the  system. 

When  all  success  paths  have  been  generated,  the  program  creates 
an  algebraic  success  model  which  can  be  used  for  the  generation  of 
reliability  point  estimates  for  the  overall  network  described  by  the 
Boolean  expression. 

The  tree  diagram  for  the  circuit  of  FIG.  2  is  shown  in  FIG.  U  along 
with  the  resulting  success  paths. 
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_The  computer  program  then  saves  these  susscesa  paths,  substituting  for 
the  A,  B,  etc.,  (1-A),  (1-B),  etc.,  respectively  as  follows: 

it  (aiSTOl)  -  ABCi+AHUatl-Cij+ABDili-GiJCl-Da^+ABCaU-CxJll-DaJCl-Di) 
+AC1(l-B)+BC1DoDx(l-A)+BCiDi(l-A)(l-Da)+BDBDi(l-A)(l-Ci) 
+HD1(1-A)(1-C1)(1-DS) 

The  equation  for  R  (SYSTEM)  is  then  stored  in  computer  memory  as 
follows  (see  Appendix*  RELIABILITY  MODEL): 

R  (SYSTEM)  -  ABCi+ABDa-ABCxDa+ABDx-ABCiDi-AHDiDa+ABCiDiDa+ABCa-ABCiCg 

-ABCaDa+ABCiCaDg-ABCaDi+ABCiCaDi+ABCaDjjDa^BCiCaDiDa+ACi 

-ABCi+BCiDiDg-ABCxDiBg+BCiDi-ABCiDi-BCiDxDa+ABCiDiDa+BDxDa 
-ABDiDa“BC1DiD3+ABCiDiB3+BDi-ABDi-BCiDi+ABCiDi-BDiDa+ABDiDa 
+BC 3D iD  a “ABC ]D i D  a 

Noting  that  A  and  B  are  similar  components  which  will  always  have  the 
same  function  and  the  same  reliability  as  will  Cj,  C2,  Di,  and  Da  the  above 
equation  will  be  reduced  to  the  following  by  the  computer: 

R  (SYSTEM)  ■  A2C+A2C-A2C2+A2C-A2C2“AaCa+AaC3+AaC“A2C2“AaC2+AaC3“A3C8+A2C3 
+A8C3“AaC4+AC“AsC+AC3-AaC3+AC2“A2C2-AC3+AaC3+A2C2“AC3+AaC3 
+AC-A8C-AC2+A2C2-AC2+A2C2+AC3-A2C3 

The  "combine  terms"  routine  is  then  applied  to  obtain  the  final  result, 
the  algebraic  success  model  for  the  circuit  in  REG.  2  or  any  circuit 
represented  by  the  logic  diagram  of  FIG.  3. 

R  (SYSTEM)  -  2A2C-6A2C2+l4A2C3-AaC*+2AC 
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VI  Conclusion 


The  program  to  carry  out  the  procedure  described  above  has  been 
developed  and  tested  on  many  hypothetical  systems.  (See  Append  Joe)  Results 
of  this  testing  brought  to  light  a  few  drawbacks  to  the  method.  These 
weaknesses  will  now  be  discussed. 

On  a  system  consisting  of  N  distinct  components,  the  number  of  branches 
which  may  be  considered  is  2^.  This  number  may  be  reduced  greatly  if  a 
proper  order  of  consideration  of  components  is  used.  The  procedure  is 
extremely  sensitive  to  this  order  end  efforts  are  now  being  made  to  deve  !-,p 
decision  mechanisms  within  the  program  to  construct  non-redundant  success 
paths  which  result  from  improper  order  of  consideration.  An  illustration 
of  this  redundancy  can  be  shown  on  the  demonstration  circuit  used  above. 
Because  of_the  order  A,  B,  Ci,  etc.  used  two  success  paths  which  result  are 
ABCi  and  ABCi.  The  same  contribution  that  these  paths  make  to  the  final 
reliability  equation  would  have  resulted  had  the  order  been  slightly  altered; 
i.e.,  A,  Ci,  B,  etc.  The  only  success  path  resulting  from  this  order  would 
have  been  ACi  and  P(ACi)  -  P(A)P(Ci).  Notice  that  P(ABCi)+P(ABCi)  «  P(A) 
P(B)P(Ci)+P  (A)P(B)P(Ci)  -  P(A)P(B)P(C1)+P(A)P(C1)-P(A)P(B)P(Ci)  -  P(A)P(CX). 
Hence,  this  change  in  order  eliminates  the  use  of  two  branches  to  come  up  with 
the  same  contribution  to  the  final  algebraic  success  model.  The  presence  of 
each  causes  unreasonable  amounts  of  computer  time  to  be  used  even  when  only 
point  estimates  rather  than  the  algebraic  equations  are  being  computed. 

A  second  problem  is  caused  by  the  need  for  large  amounts  of  computer 
storage.  This  need  arises  only  when  the  algebraic  equation  is  being  sought, 
since  each  success  path  must  be  stored  in  some  manner  so  that  final 
refinement  of  all  success  paths  as  a  whole  can  be  made  to  determine  the  final 
model  in  a  well  organised  form.  When  only  a  numerical  point  estimate  is 
sought,  there  is,  in  general,  no  need  to  be  concerned  about  memory  size. 

The  third  and  final  problem  is  a  minor  one.  It  results  from  the 
cumulative  round-off  error  that  is  present  when  many  accumulative 
multiplications  are  performed  with  very  small  numbers  while  generating 
numerical  estimates  of  reliability.  This  problem,  however,  has  largely 
been  overcome  due  to  the  availability  on  most  present  day  computers  of  the 
double  precision  variable. 

The  present  stages  of  development  of  the  procedure  are  concerned 
primarily  with  overcoming  these  difficulties.  When  the  flaws  pre  eliminated, 
the  computer  program  will  provide  to  the  engineers  a  means  of  predicting  and 
estimating  the  reliability  of  their  systems.  It  will  provide  engineering 
with  the  efficiency  and  accuracy  of  the  computer  in  determining* the 
reliability  success  models  it  requires,  saving  a  good  deal  of  time  and  money. 
Reliability  equations  that  previously  require  months  to  derive  manually,  can 
now  be  solved  in  a  matter  of  days. 


302 


VII  References 


1.  Moskowits,  F.  ,  "The  Nature  and  Behavior  of  Redundancy  Networks  and 
Functions, "  Rome  Air  Development  Center,  Oriffisa  AFB,  New  York, 
RADC-TR-60-U,  May  I960 

2.  Moskowite,  F.,  "Properties  of  Redundancy  Networks  with  Equal  Element 
Reliability  Values,"  RADC-TN-58-1U6,  June  1958 

3.  Moskowitr,  F. ,  "The  Analysis  of  Redundancy  Networks, "  Communications 
and  Electronics  AIEE,  pp,  627-632,  November  1958 

U.  Weinstock,  George  D.,  "Topological  Analysis  of  Non-Series-Parallel 
Redundant  Networks,"  RCA  Surface  Communications  Systems  Laboratories, 
May  1962 


Appendix  A 

Included  in  this  appendix  are  a  listing  of  the  computer  program 
discussed  in  the  body  of  this  report  and  an  example  of  the  program  output. 
This  output  resulted  from  the  application  of  the  computer  procedure  on  the 
single  dircuit  discussed  in  Section  V  of  this  report.  Note  that  a 
reliability  point  estimate  vres  generated  for  a  given  set  of  component 
reliability  values,  as  well  as,  the  final  reliability  model. 


PROGRAM  LISTnrO 


t)|Mf  NS  I  ON  !COMP!  30)  ,  IRRNCHn^  ),  TRUTH)  30  )  ,  FLFMNT  ( 3"  I  •  !  KI3IJNT I  ?**  "  )  , 
1ITFRMS!  31*%3'Mt«;!GN(3D0>  ,  AF  ACTO!  S'1'  ,3'I.Hm 
DUUBLF  PRECISION  B(  3*')  ♦PROR.PROBl ,  PROR* 

DATA  Q/'C!'Hl /4H  / 

BLANK*  Q"r,',HL 
DATA  Q* 01HL/4HN  / 

NEGATF*  Q^IHL 
DATA  Q"'.'?HL/4H  ♦  / 

PLUS*  QOOPHL 
DATA  0DO3HL/4H  -  / 

TlNUS*  0 AHL 
WRITE  (6, Adi 
6f-l  FORMAT!  JHU 

1  *'  READ  !5»5‘'M  Alt  I  CODE  » ISFCT 
SC-  FORMAT  (  )(  I  3,  ?  X  »  ) 

C  JCQDt  **1  CALCULATE  POINT  ESTIMATE  ONI  Y 
C  ICDOE  ■  "  FOLIATION  (INLY 
C  ICDOE  «-l  DOTH 

IF<  !CnnElRC,39,9| 

*9  KFAD  (5.5*  1)  (FlfMNT (LISTI «  LIST*1.N) 

5  31  FORMAT ( I BA4) 

DO  V*  I  LI  ST* 1 »  N 

9  WRITE!  6.59R)  Fir«NTULlST),H  1ST 
599  FI|RMAT(  H".3DX,1  WCOMPONCNT  ,  A4.950  I c  REPRFSPNYPn  AS  VARIABLE  NH“ 
1RER  ,1?) 

91  f  F ( ICODE ) 9?*  93  »  92 
9?  READ!5f5('2HD<KM)»KM*ltN) 

WRITS  15, 5<i9»N,(  A(KN1  ,KN«!  ,N> 

5Q9  FORMAT  (  I  H'V  /  54  X  f  2?HN'J  «R  FR  Of  COMPONENTS  ■  »  I?  ,/54x»?4H - - - 

l - //(  IH'-.A  (9X,01«.  1?)  )/) 

C  THT  AOOVF  STATEMENTS  HANDLE  THE  TABULATING  OF  fOMPONCNT  SYMBOL  RFn»F- 
C  SENTAT  ION,  RFI.  I A  A  I L  I  T  I  FS  OR  BOTH. 

9B  ITFRM  *  " 

ICOMP(ll»  1 
K-l 

b  IBRNCH(K)-! 

5  I COMP ( K* 1 1  * l 
m»k  f  2 

DO  1^  I *M , N 
k  icompid-o 
no  1 9* i * N 
TOf  L(  I9)«ICDMP( 19) 

IBOnLP«ICOMPU  )  *(  I  COMP  ( 2  )*ICf:MP(4)  MCOMPO)  M-ICOMPI  ?)*!  !COMP|  1  »* 

1  IC0MP(A)MCQMP!  5)  ) 

IFUBD0LF)7lf  7C  ,?1 
7*  I F !  K-N* 1  I  13. 14# 14 
1 4  WRITE! 6.61"'^) 

AO"  FORMAT! 15X.46HN0  SUCCESS  PATH  EXISTS  FOR  CIRCUIT  CONSTRUCTED ) 

GO  TO  FT 
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13  K>K+1 

go  rn  6 

71  I  TERM  »  [ TERM  ♦  1 

nn  40  f M«t.M 

m ICOMP! IM1I4?, 41,47 
42  Ll-IM 

TRUTFIUMI -BLANK 
GO  TO  40 

41  TRUTH! IN) -NEGATE 
40  CONTINUE 

IF ( I CODE  173,74,73 

73  PR OB 2  «  l,f 

00  BO  IT2-1,L1 
IfUCOMP!  IT2MBJ  ,82,81 

81  PR0B1-B! IT2) 

GO  TO  9* 

82  PROBI-1.0  -  9 ! |T?) 

8C  PROB2«PROR2*PRPBl 

PROB  «  PROB  ♦  PR0B2 
I F  (  ICODE  )  74,  2,  ?. 

74  DO  201  INDEX-i, LI 

2C1  I  TERNS ( 1 , I NDEX ) -I NDEX 
KOUNT-O 
K2-1 

199  IF ! TRUTHIK2)  -  NEGATE ) 2^ 3, 2C ? , 
702  KOUNT  «  KOUNT  ♦  1 
l KOUNT ! KOUNT I  -  K? 

198  K2  •  K2  ♦  1 
GO  TO  199 

203  IF(K 2-1 1)199,2^4,198 
2C4  IF!K0UNT)197,?rs,197 
2r5  NOTFRM-  1 
NUM- 1 

SIGN! I ) -PLUS 
GO  TO  194 

197  NOTFRM  ■  2**K0UNT 
NOSIGN  -  2*N0TERM 
DO  2^6  INDICE  •  1, NOSIGN 
206  SIGN! INOICEI-  PLUS 
NUM- 1 
KOUNTl 
ISTAGF  -1 

172  MOUNT  -KOUNTl+l 

KOUNTl  -K0UNT1  ♦  2** ( I  ST  AGE  -  1) 
DO  180  INUM-MOUNT, KOUNTl 
NEXT  ■  I KOUNT 1 1  STAGE ) 

NUM  >  NUM  *1 

IF! NEXT- 1)181,1 82, 181 
181  NEXTl-NEXT-l 

DO  15**  INDEXl-l  ,NEXTl 
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150  ITFRMSINUM,  I  NOE  XI » »  I TFRHSC  INUM,  tNOFXl) 

182  itermsinum.next)*  r. 

NEXT2  -  NEXT  ♦  1 

DO  151  INDEX?  «  NFXT2  »Ll 

151  1  I tK MS  I  NUN, INDEX?)  ■  l TERMS! I NJM, INDEX  2) 
SIGNINUMI-SIGNI INUM) 

NUM«NUM  ♦  1 

IFISIGN<INUM)-TFNUS) 152,  16?, 15? 

152  SIGN ( NUN)  «  TINUS 
GO  TO  17* 

162  SIGNINUMI -PLUS 

17*  IFINEXT-1 )19l, 182,101 

151!  00  153  INDFX3«l,N6XTl 
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BEST  FITTING  LINF.aR  VARIETIES 


Robert  m.  Thraii 
Willow  Run  Laboratories 
University  of  Michigan,  Ann  Arbor,  Michigan 

1.  INTRODUCTION.  We  consider  a  generalization  of  the  classical 
problem  of  finding  the  best  fitting  linear  function  for  a  set  of  data.  The 
results  obtained  are  stated  in  the  language  of  eigenvalues  and  principal 
components  and  take  a  form  which  is  not  explicit  in  the  usual  textbook 
treatments  of  principal  components.  In  1901  Karl  Pearson  in  his  paper 
"On  Lines  and  Planes  of  Closest  Fit  to  Systems  of  Points  in  Space" 
(London  Philosophical  Magazine,  Sixth  Series,  Vol.  2,  1901,  pp.  559-572) 
stated  and  solved  the  problem  for  ordinary  three  space.  The  texts 
M.  J.  Kendall,  A  Course  in  Multivariate  Analysis,  and  T.  W.  Anderson, 
Multivariate  Statistical  Analysis,  treat  the  standard  principal  component 
theory  and  give  useful  numerical  examples.  R.  Bellman  (Introduction  to 
Matrix  Analysis,  McGrawHill,  New  York  I960,  pp.  1 1 3- 1 15)  develops 
the  same  topic  from  a  slightly  different  point  of  view  using  the  Courant- 
Fischer  min-max  Theorem. 


2.  SOME  ALGEBRAIC  BACKGROUND.  Let  V  =  be  the  space  of 

column  vectors  of  degree  k  over  the  real  field.  A  sequence  C^,.  ..  ,C 
of  vectors  in  V  is  said  to  be  orthonormal  if 


C.  a 
J 


^0  if  i  /  j 
^1  if  i  =  j 


(i,  j  =  1, . .  .  ,n). 


(We  use  the  superscript  T  to  denote  matrix  transposition.)  A  matrix 

C  =  [C.  . .  .  C  ]  is  said  to  be  orthonormal  if  its  columns  constitute  an 
1  r 

orthonormal  sequence  or  equivalently  if  C  C  =  1^. 

A  subset  W  of  V  is  said  to  be  a  subspace  if  it  is  closed  under 
addition  and  multiplication  by  scalars,  a  subset  M  of  V  is  said  to  be 
a  linear  manifold  if  it  has  the  form  M  =  W  +  X  =  {X  +X  |XtW}  for  some 

subspace  W;  i.  e.,a  linear  manifold  is  just  the  parallel  displacement  of 
a  vector  space.  For  any  matrix  C  we  denote  by  L*(C )  the  set  (subspace) 
of  all  solutions  of  the  equation  CX  =  0  and  denote  by  L(C)  the  subspace 
consisting  of  all  linear  combinations  of  columns  of  C.  For  any  subspace 
W  there  exist  matrices  A  and  B  such  that  W  =  L!‘(A)  =  L(B).  If 
dim  W  =  r  we  may  assume  that  A  has  shape  (k-r)-by-k  and  that  B  has 


shape  k-by-r  ;  we  may  also  assume  that  A  and  B  are  both  orthonormal. 

A  square  orthonormal  matrix  P  is  said  to  be  orthogonal  ;  for  a!hy 
non- square  orthonormal  matrix  C  there  is  a  second  orthonormal  matrix 
D  such  that  P  *  [CD]  is  orthogonal  and  for  which  L(C)  *  L.*(dT). 

A  k-by-k  matrix  A  is  said  to  be  positive  semi-definite  if  X^AX  >  0 

T  ' = 

for  all  X  in  V;  if  also  X  AX  =  0  implies  X  ■  0,  A  is  said  to  be. positive 

definite.  If  A  is  positive  semi -definite  there  exists  an  orthogonal  matrix 

P  such  that 


A=  ptap  = 


is  a  diagonal  matrix  whose  diagonal  entries  satisfy  the  condition 

a.  ^  A.  ^  ^  A.  >  0  . 

1=2=  =  k  = 

The  numbers  are  called  eigenvalues  of  A  ;  a  positive  semi- 

definite  matrix  is  positive  definite  if  and  only  if  X^  >  0.  The  j-th  column 
Pj  of  P  satisfies  the  condition  AP^  «*  X.  P  and  is  called  an  eigenvector 
for  A  belonging  to  the  eigenvalue  X^(j  =  1, .  . .  ,k). 

If  A  is  any  k-by-k  matrix  we  define  the  trace  of  A  by 
tr  A  =  an  +  *22  +  ••■  +  *kk  ■ 

If  a  product  BC  of  two  matrices  B  and  C  is  square  then  so  is  CB  and 


tr  BC  **  tr  CB  . 


A  matrix  G  is  said  to  be  a  projection  if 

T 

GG  =  G  . 
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If  C  is  orthonormal  then  CC  is  a  projection. 

* 

If  W  =  L(C)  ujjhere  C  is  a  k-by-r  orthonormal  matrix  the  projection 
of  any  Vector  X  on'*W  is  the  vector 

Xo  *  2ir.l  <xTci>  / 

T 

then  X  -  X  is  perpendicular  to  X  ,  (X  -  X  )  X  =  0)  and  the  squared 
.  o  o  ■  o  o 

distance  d(X,  W)2  from  X  to  W  is  given  by 

* 

d(x,  w)2  =  (x  -  X0)T(X  -  xo)  .  xT  x  -  x2  xo  ; 

■  xTx-r.i  Ejr.i  (XT  C.)  (XT  C.)  c2  c. 

•  XTX  -  =  ].,(xTc.)2 

T  T  T 

=  XX  -  X  GC  X 


xT  (I  -  ccT)x  . 


Let  [  CD]  be  an  orthogonal  matrix  where  D  has  s  =  k  -  r  columns.  ] 

Then  j 

i 

I  =  [CD]  [CD]  T  =  CCT  +  DDT  ] 


so  that 


2  T  T 

d(X,  W)  =  X  DD  X 


and,  moreover,  W  *  L(D), 

Next,  let  B  *  [B^  . .  .  B^]  be  a  k-by-n  matrix  and  let 
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■  /H  ■ 


d(B,  W)2  =  1  d(B.,  W)2 


=  2“  B*  DDAB. 
i*l  i  i 


since 


Y'  4  *, 


T  T 

=  tr  B  DD  B 


»  *  '  *  4 

•  #  i  * 

’  T 

. 

1  *  * 

B  f 

j 

(BTD)(DTB)  = 

p  •* 

* 

• 

I 

T 

B  D 
■  n 

*  [dtb  . . .  dtb  ]  = 

1  nJ  i 

T  T 
B^DD^ 

Now,  since  tr  is  a  symmetric  function,  we  have 


d(B,-W)2  =  tr  BT  DDTB  =  tr  DT  BBTD 


Next,  let  M  be  an  orthogonal  matrix  for  which 

X, 


hiT(BBT)M  = 


'K 


-  A 


where  A,  >  A..  >  .  . ,  >  A.  >  0  . 
1=2=  =  k  ■ 


Then 


d(B,  W)2  =  tr  (MTD)T(MT  BBTM)  (MrD) 
=  tr  D|TAd'  *  d(A»  W')2 


where  D'  =  M^D  is  also  orthonormal  and  W'  =  L  (D1^)  . 

3.  THE  BEST  FITTING  LINEAR  SPACE.  We  now  state  our  problem. 
Given  B,  find  the  space  W  of  dim  r  which  minimizes  d(B,W)  ,  the  sum 
of  the  squared  distances  from  the  columns  B  to  W. 
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We  see  that 


miri  {d(B,W)*  |  dim  W  =  r}  *  min{tr  DT  BBTE>  J  D  orthonormal 

of  rank  s  *  k  -  r} 

T  A 

=  min{tr  D'  A.  D'  |  D'  orthonormal 
of  rank  s  =  k  -  r}  . 


We  now  show  that 


Dp  *  [  ^  ]  minimizes  tr  D'  tAd  , 

s 

min  {d(B,  W)2  |  dim  W  =  r}  =  Ap+1  +  .  .  .  +  \ 


and  for  the  minimizing  space  Wp  we  have 

(3)  Wp  =  L*(Dq)  -  L(Cq) 


where  M  r  [CpDp]  is  the  partitioning  of  M  into  its  firBt  r  and  last 
s  columns. 

Thus,  we  conclude  that  W  is  the  space  spanned  by  the  r  eigen¬ 
vectors  with  largest  eigenvalues. 

Since  (2)  and  (3)  follow  at  once  from  (1),  we  need  only  establish  (l): 


D'TAd'  =  [Ff...FT]  \ 

1  n 


k  T 
=  L  A  F  F 
ial  V  i  *  i 


where 

Then, 


e  Fj,  is  the  i-th  row  of  D'(i=l,  . .  .  ,k). 

,  since  D'  is  orthonormal,  0  <  y  <  1. 

&  1  B 


Next,  let  y.  =  F^F.<i=l,  .  .  .  ,k). 
Moreover, 


yi  =  tr  D'TD'  a  tr  D'  D'T  =  tr  Ig  =  s  . 
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Thus, 


mintrD'  D'  >  min  {2A{y.  |  0  <  y.  <  1,  y,  + 

T*%  •  ***  .  *  .  “  ^  _ 


+  . . .  +  yk  =  s }  . 


This  simple  linear  programming  problem  has  the  solution  y  ■  . . ,  *  y  =  o, 

1  r 

yy+1  *  . . .  *  *  1  and  since  these  y,  are  realized  by  D^,  (1)  is 

established. 

4.  BEST  FITTING  LINEAR  MANIFOLD.  Any  linear  manifold  M 
of  dimension  r  can  be  written  in  the  form 

M  =  L(C)  +  h  C 


where  [CqC]  is  an  orthonormal  matrix  and  W  *  L(C)  is  the  related 

linear  space.  To  calculate  the  distance  from  any  vector  X  to  M  we  first 

find  the  unique  vector  Y  in  M  for  which  X  -  Y  is  orthogonal  to  M  or, 

equivalently  to  W.  Let  X  be  the  projection  of  X  on  W  as  defined  in 

Section  2  above.  Then  X  is  orthogonal  to  W  and  since  C  is  also 

o  o 

orthogonal  to  W  the  vector  X  -XQ  -hCQ  is  also  orthogonal  to  W.  But 

X  +  hC  is  in  M;  hence  Y  =  X  +  hC  .  Thus  we  have 
o  o  o  o 

d(X,M)2  =  (X  -(Xo  +  hC  )f{X  -  (X  +  hC  )) 


=  (x  -  Xo)T(X  -  Xo).  2h(X  -  Xo)TCo  + 


2  T 

+  h  C  C 
o  o 


Referring  to  the  notations  and  calculations  in  Section  2  and  using  the 
fact  that  [CqC]  is  orthonormal  this  can  be  written  as 

d(X,M)2  =  XT  DDTX  -  2hXT  C  4  h2  . 

o 


Now,  let  A  =  [A , .  .  .  ,  A^]  be  any  k-by-n  matrix.  Then  the  sum  of 
the  squared  distances  of  the  columns  of  A  to  M  is  given  by  (cf  Section  3) 
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d(A,  M)2  =  tr  ATDDTA  -  2nkAT  C  +  nk2 

o  o 

•  *  T  T  T  7  T  7 

~  tr  A'DD  A  +  n(h  -  Aq  Cq)~  -  n(AQ  Cq) 


where  nA  =  £  A  ■  i.e.  A  is  the  mean  of  the  vectors  A . A  . 

o  i  o  I  n 

From  this  formula  it  is  clear  that  for  any  choice  of  the  matrix  C, 

d(A,M)  is  then  minimized  by  taking  h  =  A^C^  and  choosing  Cq  orthogonal 

to  W  and  so  as  to  maximize  aJcq.  This  is  clearly  acheived  by  taking  Cq 

as  the  unit  vector  in  the  direction  of  the  projection  of  Aq  on  L(D)  (the 

orthogonal  complement  of  W).  Then  the  projection  of  A  on  L>(D)  will  be 

T  0 

(A  C  )C  a  hC  and  we  conclude  that  the  minimizing  linear  manifold 

contains  the  mean  A  of  the  columnjof  A.  We  use  this  fact  to  reduce  the 
. ■'■'*'  1  — . .  o  — — 

best  fitting  manifold  problem  to  the  best  fitting  vector  space  problem  which 

we  have  already  solved. 

Clearly,  for  any  vector  Z, 


d(X,M)2  *  d(X  -  Z,  M  -  Z)2 


In  particular  for  Z  =  A^  we  have 

d(X,M)2  =  d(X  -  Ao>  W)2 

and  hence 

d(A,M)2  a  d(B,W) 

where 

B  =  t(Al  -  Ao)  .  .  •  <An  -  Aq)  ] 

is  obtained  by  subtracting  A  from  each  column  of  A.  Hence,  if  W  is 

o 

the  best  fitting  linear  space  for  B  then  M  =  W  +  A  is  the  best  fitting 
linear  manifold  for  A.  ° 
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5.  SUMMARY.  The  results  of  the  preceding  two  sections  can  be 
summarized  as  follows.  Let  A^, . .  .  ,  A^  be  any  n  vectors  in  k- space, 

iet  A  be  the  matrix  whose  columns  are  these  vectors,  iet  A  be  the  mean 

o 

of  the  n  vectors,  and  let  B  be  the  matrix  whose  columns  are  A^  -  A^,  .  .  .  , 

A^  -  Aq.  Then  the  best  fitting  linear  space  W^(A)  of  dimension  r  for 

A., ...  i  A  has  a  basis  the  eigenvectors  corresponding  to  the  r  largest 
*  n  T 

eigenvalues  of  AA  and  the  sum  of  the  squared  distances  of  the  vectors  to 
this  space  is  the  sum  of  the  k  -  r  smallest  eigenvalues  of  AA^.  (in  the 
case  of  equal  eigenvalues  the  generating  eigenvectors  must  be  independent 
but  this  is  guaranteed  if  they  are  selected  as  columns  of  an  orthonormal 
matrix  as  above.) 

The  best  fitting  linear  manifold  M  (A)  of  dimension  r  for  these 

vectors  is  then  W  (B)  +  A  and  the  sum  of  the  squares  of  the  distance 
r'  o  T 

is  the  sum  of  the  k  -  r  smallest  eigenvalues  of  BB  . 

If  one  wishes  the  average  squared  distance  from  the  vectors  to  M  (A) 
the  number  above  is  divided  by  n.  This  can  be  acheived  alternatively 

by  using  the  matrix  G  =  The  result  is  that  M^(A)  =  Wf(G)  +  A^ 

and  the  average  squared  distance  is  the  sum  of  the  k  -  r  smallest  eigen¬ 
values  of  the  covariance  matrix  of  GG^  =  (*/n)BB^\  Suppose  that 

M  *  [M, ....  M.  1  is  an  orthogonal  matrix  for  which 
1  h 


T  T 

M  GG  M  = 


where  >  .  . .  >  >  0.  Then  the  columns  M^,...,M^  are 

called  the  principal  components  of  the  distribution  A^, .  . .  ,  A^,  and  the 
first  r  principal  components  constitute  a  basis  for  W^G). 

What  is  usually  stated  in  statistical  texts  is  that  the  first  principal 
component  gives  the  best  fitting  line;  that  the  second  principal  component 
gives  the  best  fitting  line  orthogonal  to  the  first;  and,  in  general,  that  the 
r-th  principal  component  gives  the  best  fitting  line  orthogonal  to  the  space 
generated  by  the  firBt  (r  -  1)  principal  components.  It  is  not  stated  expli¬ 
citly  that  the  first  r  principal  components  give  the  best  fitting  space  of 
dimension  r. 
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PLANNING  AND  ANALYSIS  OF  NON -EXPERIMENTAL  STUDIES'!' 


W.  G.  Cochran 
Harvard  University 
Cambridge,  Massachusetts 

1.  INTRODUCTION.  During  the  past  20  years  a  marked  increase 
in  statistical  studies  of  human  populations  has  taken  place.  Several 
reasons  for  this  can  be  suggested.  Successful  applications  of  operations 
research  during  World  War  II  led  to  an  expanded  use  of  this  technique  in 
business  and  marketing  after  the  war.  Public  opinion  polls,  which  proved 
interesting  and  informative  as  news  media,  stimulated  the  growth  of 
agencies  equipped  to  take  sample  surveys  for  clients.  The  provision  of 
increased  amounts  of  money  for  field  research  in  the  social  sciences  also 
contributed. 

In  many  of  these  studies,  the  objective  is  primarily  descriptive --to 
get  the  basic  facts  about  some  problem.  Examples  are  the  monthly 
estimates  of  numbers  of  employed  and  unemployed,  or  a  survey  undertaken 
in  a  city  to  estimate  the  amount  of  delinquency  among  teenage  boys  according 
to  some  definition  of  this  term. 

In  other  investigations,  interest  focuses  on  the  study  of  relationships. 
For  my  purposes,  I  should  like  to  distinguish  two  classes  within  this  type, 
although  they  shade  into  one  another.  The  first  class  consists  of  broad 
analytical  surveys  in  which  a  number  of  variables  are  being  investigated 
simultaneously  by  multiple  classification  or  multiple  regression,  or  by 
setting  up  models  involving  systems  of  equations,  as  in  econometrics.  For 
instance,  in  a  recent  study  organized  by  the  U.  S.  Office  of  Education  [l  ], 
standard  tests  were  given  to  school  children  in  grades  1,  3,  6,  9  and  12. 

By  multiple  regression  methods,  estimates  were  obtained  of  the  contribu¬ 
tion  made  to  the  child's  performance  by  various  characteristics  of  the  school 
attended,  by  the  home  environment  and  parental  attributes,  and  by  the  child's 
aspirations  and  self-concept. 

When  these  studies  are  exploratory,  the  discovery  of  the  relationships 
that  are  present  suggests  the  question;  Why?,  leading  the  investigator  to 
set  up  plausible  hypotheses  about  the  causal  forces  at  work.  In  other 
studies,  causal  hypotheses  may  already  have  been  proposed,  the  purpose 
of  the  study  being  to  verify  whether  the  predictions  about  relationships  made 
from  a  casual  model  are  consistent  with  the  results. 


’i'This  work  was  facilitated  by  a  grant  from  the  National  Science  Foundation 
(GS-341). 


My  second  class  of  analytical  surveys  is  narrower  in  scope  and  more 
intimately  bound  up  with  the  idea  of  cause  and  effect.  The  investigator 
tuiiteiiiioics  un  a  apecific  pre uumeu  cauaai  ageni  anti  tries  Lo  measure 
certain  aspects  of  its  effects.  Examples  are  the  effects  of  wearing  lap 
seat  belts  on  the  amount  and  types  of  injury  sustained  in  auto  accidents 
the  effects  of  air  pollution  on  illness  associated  with  the  respiratory  organs, 
the  effect  of  a  new  contraceptive  device  on  the  birth  rate  during  the  next  five 
years,  and,  to  cite  a  World  War  II  study,  the  effect  of  bombing  on  the  morale 
of  the  bombed  people. 

These  studies  resemble  controlled  experiments,  because  we  set  out  to 
measure  the  effects  of  certain  'treatments -the  causal  agents.  However, 
in  the  'non-experimental '  studies  with  which  I  am  concerned,  the  investigator 
is  unable,  for  practical  or  ethical  reasons,  to  use  the  two  chief  weapons  of 
controlled  experimentation.  He  cannot  select  the  subjects  who  are  to  receive 
the  causal  agent  and  the  subjects  from  whom  it  is  to  be  withheld.  If  the 
agent  is  one  that  may  be  present  in  greater  or  less  amount,  as  with  air 
pollution  or  bombing,  he  has  no  control  over  these  amounts,  but  must  take 
them  as  he  finds  them. 

The  design  and  analysis  of  controlled  experiments  has  become  iairly 
well  categorized  and  standardized.  Most  university  courses  on  the  subject 
discuss  completely  randomized,  randomized  blocks,  and  latin  square  plans 
(sometimes  under  different  names)  and  go  on  to  factorial  experimentation 
and  to  techniques  for  estimating  response  surfaces.  This  standardization 
brings  with  it  the  usual  benefit  of  economy  of  effort;  once  learned,  the 
techniques  of  planning  and  analysis  can  be  applied,  often  with  only  minor 
variations,  in  widely  different  areas  of  research, 

With  non-experimental  studies  much  less  standardization  of  this  type 
has  occurred.  There  is  less  cumulative  experience  with  the  various  types 
of  study  plan.  In  the  principal  fields  in  which  these  plans  are  used-- 
sociology ,  psychology,  education,  market  research,  and  public  health-- 
workers  have  only  recently  begun  to  learn  from  one  another.  Statisticians 
have  shown  limited  interest  in  the  logical  structures  of  the  plans. 

While  non-experimental  studies  present  many  issues  that  merit 
discussion,  this  paper  will  be  confined  to  three  topics,  as  follows. 

Some  preliminary  aspects  of  planning. 

Simple  types  of  study  plan. 

Techniques  for  increasing  precision  and 
eliminating  bias. 
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2.  PRELIMINARY  ASPECTS  OF  PLANNING.  Being  unable  to  apply 
the  causal  agent  in  which  he  is  interested,  the  investigator  in  a  non- 
experimental  study  must  first  find  some  locale  in  which  the  agent  is 
operating  or  will  operate  under  conditions  suitable  for  measuring  its 
effects.  In  this  search  the  following  questions  must  be  kept  in  mind,  all 
of  them  matters  of  judgment  rather  than  of  black  and  white. 

1.  Is  the  cause  operating  in  sufficient  strength?  Sometimes, 

for  reasons  of  convenience  or  expense,  the  investigator  chooses  an  environ¬ 
ment  in  which  the  causal  force  operates  too  weakly  to  allow  its  effect  to  be 
measured  in  the  size  of  sample  that  is  feasible.  For  instance,  airline 
pilots  might  be  considered  a  convenient  source  from  which  to  study  predic¬ 
tors  of  heart  disease,  since  they  receive  repeated  and  thorough  medical 
examinations  of  which  records  are  kept.  On  the  other  hand,  one  of  the 
criteria  by  which  they  are  selected  is  that  they  are  the  kind  of  men  who  are 
unlikely  to  develop  heart  disease. 

2.  What  other  important  variables  are  present  whose  effects  may 
be  confounded  with  those  of  the  causal  variable?  How  will  they  be  handled? 

In  planning  a  study  of  the  effects  of  air  pollution,  an  investigator  might 
look  for  three  residential  areas  in  the  same  city,  one  heavily  polluted,  one 
moderately,  and  one  relatively  free  from  pollution.  But  it  is  likely  that 
the  residents  of  these  areas  will  show  a  sizeable  gradient  in  socioeconomic 
levels,  which  might  account  for  any  differences  found  in  respiratory  illness. 

If  the  investigator  confines  himself  to  areas  closely  similar  in  socioeconomic 
level,  he  may  find  that  the  differences  in  amounts  of  air  pollution  are  quite 
small,  thus  becoming  involved  in  the  difficulty  mentioned  in  point  1.  Methods 
for  handling  confounded  variables  are  discussed  later  in  this  paper.  If, 
however,  an  important  variable  is  too  highly  correlated  with  the  causal 
variable,  as  might  be  the  case  in  the  air  pollution  example,  there  may  be 

no  way  to  disentangle  their  effects. 

3.  What  measurements  are  to  be  taken?  What  is  known  about  the 
precision  and  accuracy  of  the  measurements?  Marty  aspects  of  human  life 
and  behavior  present  formidable  problems  of  measurement*  e.g.  ,  how 
does  one  measure  morale?  In  large  studies,  the  measurement  process  may 
be  restricted,  for  reasons  of  expense,  to  responses  on  a  printed  question¬ 
naire.  Substantial  biases  in  measurement  can,  of  course,  produce  badly 
misleading  results.  "Random"  errors  of  measurement  of  the  effects 
decrease  the  precision  of  the  results.  "Random"  errors  in  measuring  the 
strength  of  the  causal  variable  (e.  g.  number  of  cigarettes  smoked  per  day) 
will  produce  an  underestimate  of  the  size  of  the  effect.  Similarly,  "random" 
errors  in  measuring  a  confounded  variable  decrease  the  effectiveness  of 

the  standard  statistical  methods  for  removing  the  disturbing  effects  of  this 
variable. 
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4.  If  the  study  is  to  be  made  from  records  already  collected  by 
someone  else,  have  the  records  been  checked  as  to  completeness,  accuracy, 
and  accessibility?  It  is  alw  „s  worth  considering  whether  a  study  can  be 
made  from  existing  records,  not  only  because  of  cost  but  because  this  may 
be  the  only  way  to  obtain  results  in  a  reasonably  short  time.  Sometimes, 
investigators  construct  plans  and  engage  tiLaff  for  a  study  ori  Lhe  basis  of 
someone's  assurance  about  the  quality  of  the  records  that  turns  out  to  be 
greatly  over-optimistic,  particularly  when  the  records  are  kept  for  some 
legal  or  administrative  purpose  but  rarely  used  or  examined.  A  careful 
pilot  survey  of  the  records,  designed  to  reveal  any  weaknesses  for  the 
purpose  at  hand,  is  essential  before  commitments  are  made, 

5.  How  will  the  sample  size  or  sizes  be  determined?  In  controlled 
experimentation  there  are  formulas  that  provide  guidance  about  sample  size 
by  calculating  the  size  needed  to  estimate  the  effect  with  a  prescribed  width 
of  95%  confidence  interval,  or  the  size  for  which  some  basir  tosL  of 
significance  will  have  a  prescribed  power.  It  is  advisable  to  try  to  use 
these  formulas  in  non-experimcntal  studies  also.  However,  in  order  to 
obtain  useful  numerical  answers  from  these  formulas  one  must  have  an 
estimate  of  (i)  the  standard  deviation  per  observation  and  (ii)  the  likely 
size  of  the  effect  that  is  being  estimated.  In  exploratory  studies  these 
estimates  may  be  lacking,  and  the  investigator  may  have  to  use  simply  the 
largest  sample  size  that  can  be  afforded,  having  speculated  that  this  size 

is  more  likely  to  be  too  small  than  too  large. 

6.  If  non-response  or  later  melting-nway  of  the  sample  is  anticipated, 
what  are  the  plans  for  coping  with  it?  This  is  a  common  problem,  especially 
when  participation  in  the  study  is  somewhat  of  an  imposition  on  the  subjects, 
or  when  the  study  extends  for  several  years.  Investigators  tend  to  be  lax 
about  non-response.  The  standard  call-back  or  follow-up  questionnaire 
procedures  developed  in  sample  surveys  are  often  surprisingly  helpful, 
Sometimes  it  is  feasible  to  follow  people  who  move  within  the  same  metro¬ 
politan  area  even  if  it  is  too  costly  to  follow  those  who  leave  the  area. 
Sometimes  background  information  about  non-rfBpondents  is  available,  or 
can  be  obtained  by  mail,  that  assists  a  judgement  about  the  extent  to  which 
they  bias  the  conclusions.  Speculations  about  the  extent  to  which  nun- 
respondents  might  bias  the  results  can  always  he  made  much  more 
comfortably  with  a  10%  than  with  a  30%  non-response  rate. 

7.  What  are  the  comparisons  from  which  the  size  of  the  presumed 
causal  effect  will  be  estimated?  Numerous  points  arise  hero.  In  Homo 
studies  the  'cause  present'  group  is  clearly  defined,  but  it  is  less  clear 
what  can  be  used  as  a  'cause  absent'  group  for  comparable  purposes.  Often 
it  is  important  to  estimate  the  causal  effect  separately  in  different  subgroups 
of  the  population  (e.g.  for  people  of  different  ages,  far  man  and  women). 
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The  types  of  adjustment  to  be  made  for  handling  confounded  variables  are 
also  relevant. 

8.  Is  the  environment  a  'typical'  one  from  the  viewpoint  of  generali- 
zability  of  results?  Sometimes  an  ingenious  investigator  finds  a  group  of 
people  (for  instance  a  special  religious  sect)  among  whom  the  causal  force 
is  operating  with  no  important  confounded  variables.  But  he  may  reluctantly 
decide  not  to  attempt  the  study  in  this  group,  because  they  seem  atypical  in 
so  many  respects  that  any  generalization  of  results  would  appear  hazardous. 

With  some  problems  of  great  interest  and  importance,  investigators 
have  to  search  for  a  long  time  before  a  suitable  environment  is  found. 
Sometimes  none  is  found:  in  other  cases  we  are  restricted  to  the  type  of 
study  that  can  be  done  rather  than  the  type  we  would  like  to  do.  Consider 
the  problem  of  investigating  in  human  subjects  the  effects  of  exposure  to 
atomic  radiation  on  illness  and  death  rates.  Ideally,  the  answer  would 
take  the  form  of  a  dosage-response  curve,  the  rate  being  expressed  as  a 
function  of  the  exposure  history  (amount  and  duration). 

As  pointed  out  by  Seltser  and  Sartwell  [2]  ,  the  principal  opportunities 
for  investigations  in  human  subjects  are  confined  to  the  following:  (a)  the 
Japanese  survivors  of  the  atomic  bombs  in  Hiroshima  and  Nagasaki, 
involving  a  single  exposure,  (b)  groups  occupationally  exposed  to  radiation 
at  times  when  the  possible  danger  from  this  source  was  not  realized-- 
radiologists,  dentists,  and  makers  of  watches  with  luminous  dials,  (c) 
persons  who  received  medical  radiation,  as  in  the  treatment  of  some  forms 
of  cancer,  or  infants  exposed  in  utero through  pelvic  X-rays  of  the  mother 
in  the  late  stages  of  pregnancy,  and  (d)  areas  of  the  earth  in  which  natural 
radioactivity  is  unusually  high. 

None  of  these  sources  provides  more  than  limited  material  for  con¬ 
structing  a  dosage-response  curve.  To  illustrate  the  types  of  study  that 
have  been  undertaken,  long-term  studies  in  Hiroshima  and  Nagasaki  were 
initiated  in  1950.  In  Hiroshima  the  sample  contains  about  12,  000  people, 
divided  into  4  groups  of  about  3,  000  each,  according  to  their  distances 
from  the  point  of  impact  of  the  bomb.  The  subjects  receive  regular  health 
examinations,  with  particular  attention  to  any  symptom  that  might  be  an 
after-effect  of  radiation  exposure. 

A  study  of  this  type  is  expensive  and  administratively  difficult. 
Fortunately,  the  health  data  also  permit  many  useful  investigations  of 
general  health  questions.  From  the  viewpoint  of  the  dosage -response 
curve,  a  weakness  \s  that  the  dose  to  which  any  person  was  exposed  is  not 
known,  but  has  had  to  be  estimated  roughly  from  memory  of  a  person's 
location  and  local  shielding  by  buildings  at  the  time  when  the  bomb  fell. 


Also,  the  group  furthest  from  the  epicenter,  who  serve  as  the  non-exposed 
group,  differ  in  some  important  characteristics  from  the  three  exposed 
groups,  and  have  proved  unsatisfactory  as  a  'control'  [3]  . 

The  study  by  Seltser  and  Sartwell  [2]  of  the  mortality  of  radiologists 
is  an  excellent  example  of  the  possibilities  from  groups  occupationally  or 
medically  exposed.  They  chose  male  members  of  the  Radiological  Society 
of  North  America.  For  each  member  they  obtained  by  a  painstaking 
search  the  status  (dead  or  alive)  as  of  December  31,  1958,  with  cause  of 
death  and  any  available  information  on  other  factors  such  as  age  that  might 
influence  duration  of  life.  Research  of  this  type  always  raises  the  question; 
with  what  are  the  exposed  group  to  be  compared?  Ideally,  we  seek  a 
non-exposed  group  which  is  similar  to  the  exposed  group  with  regard  to 
any  other  variable  that  is  known  or  suspected  to  have  a  material  effect  on 
duration  of  life.  (In  this  example  an  obviously  relevant  variable  is  age.  ) 

In  an  observational  study  the  extent  to  which  this  goal  can  be  met  is  of 
course  dependent  on  our  ability  to  measure  such  variables  and  to  find  a 
group  that  has  similar  distributions  with  respect  to  them. 

The  authors  chose  two  comparison  groups.  As  the  nearest  to  a  non- 
exposed  group  they  used  the  American  Academy  of  Ophthalmology  and 
Otolaryngology,  whose  members  rarely  have  occasion  to  employ  X-radia¬ 
tion.  As  an  intermediate  group  they  also  included  the  American  College 
of  Physicians  ,  since  some  of  these  members  use  X-rays,  for  example,  in 
heart  examinations.  In  such  studies  the  inclusion  of  a  middle  group  is 
advantageous  in  either  adding  confirmation  to  the  results  given  by  the  two 
extreme  groups  or  in  casting  doubt  upon  them.  This  study,  however, 
again  has  the  weakness  that  no  measures  of  the  doses  of  radiation  experi¬ 
enced  by  the  subjects  are  available,  except  as  a  rough  guess  for  the  group 
as  a  whole. 

3.  SIMPLE  TYPES  OF  STUDY  PLAN.  This  section  introduces 
some  simpler  types  of  plan,  with  a  brief  discussion  of  their  strengths  and 
weaknesses  and  of  the  statistical  analysis. 

3.  1  A  single  group,  measured  before  and  after  the  action  ol  the 
causal  agent.  This  type  is  common  when  the  causal  agent  is  of  short: 
duration.  For  example,  after  complaints  about  the  time  taken  to  go 
through  a  cafeteria  line,  a  change  in  the  service  is  proposed  that  it  is 
claimed  will  remove  the  bottleneck.  Before  this  change  is  made,  the 
times  taken  to  go  through  are  recorded  for  a  random  sample  of  the  usei  s , 
the  same  being  done  after  the  change  is  made.  In  other  situations,  the 
causal  agent  might  be  DDT  spraying  of  10  villages,  an  estimate  of  the 
misquito  population  being  made  before  and  after  spraying,  or  a  radio  and 
TV  appeal  which  the  stations  in  an  area  agree  to  give  on  a  certain  day, 
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urging  mothers  to  bring  their  children  into  the  clinics  in  a  city  for 
immunizations,  the  number  of  children  appearing  for  immunization  being 
counted  in  each  clinic  during  the  weeK  beiore  and  the  week  after  this  appeal. 

Unlike  the  radiologists  example,  such  studies  have  no  comparison 
group,  usually  because  all  members  of  the  population  of  interest  are 
exposed  (at  least  potentially)  to  the  causal  agent.  Sometimes,  as  in  the 
DDT  example,  a  comparison  group  of  unsprayed  villages  might  have  been 
chosen,  but  is  excluded  for  administrative  or  financial  reasons.  Often,  a 
single-group  study  is  the  only  feasible  approach  in  attempting  to  learn 
something  about  the  effects  of  new  governmental  programs  or  laws  that 
apply  to  everyone. 

The  absence  of  a  comparison  group  is,  of  course,  the  major  weakness. 
Any  other  event  that  produces  a  change  in  the  level  of  the  variable  during 
the  Before -After  period  has  its  effects  inevitably  confounded  with  those  of 
the  causal  agent.  Campbell  and  Stanley  [4]  give  a  detailed  catalogue  of 
these  sources  of  bias  in  educational  research.  If  the  investigator  is  aware 
of  such  other  influences  he  can  sometimes  ask  questions  about  the  reasons 
for  people 1  s  change  in  behavior  that  help  him  to  judge  whether  these 
influences  have  been  important.  Knowledge  that  a  change  is  coming  may 
influence  people's  behavior  immediately  before  the  change,  so  that  the 
After-Before  difference  is  misleading, 

Although  the  conclusions  from  studies  of  this  type  involve  a  substantial 
element  of  judgment,  the  studies  are,  as  Campbell  and  Stanley  {Jut  it, 

"worth  doing  when  nothing  better  can  be  done",  I  might  express  it  a  little 
more  positively.  With  new  public  programs,  plana  to  estimate  their  effects 
are  often  not  initiated  until  some  time  after  the  program  has  been  running. 

By  this  time  it  is  difficult  to  get  good  'Before'  measurements  and  too  late 
to  take  precautions  or  gather  supplementary  information  that  might  have 
helped  in  judging  the  effects.  The  question:  How  can  we  study  the  effects 
of  this  program?  should  be  raised  some  time  before  the  program  begins. 

The  statistical  analysis  usually  involves  examining  the  difference 
between  two  paired  or  independent  samples.  The  samples  may  be  sub- 
classified  by  another  variable,  e.g.  ,  age  of  subject,  in  order  to  reveal 
any  variation  in  effect  with  age. 

Sometimes  there  is  reason  to  expect  that  the  Before  measurement  will 
itself  influence  the  subject's  behavior.  A  plan  that  has  been  proposed  is 
to  have  two  groups,  both  exposed  to  the  cause.  Whenever  feasible,  these 
can  be  random  halves  of  an  initially  chosen  group.  Group  l  is  measured 
'Eefore'  and  'After',  group  2  is  measured  'After'  only.  The  idea  is  that 
by  comparing  the  two  'After'  sets  of  results,  we  can  test  whether  the  'Before' 
measurement  influenced  the  level  of  the  'After'  responses  in  group  1. 


The  beat  method  of  estimating  the  size  of  the  causal  effect  presents 
a  problem  involving  the  pooling  of  data  after  performing  a  test  of  signifi¬ 
cance.  If  the  subscripts  a  and  b  denote  'After'  and  'Before1,  the  difference 
(Y_  -Y.,  )  is  an  unbiased  estimate  of  the  causal  effect.  Assuming  a 

C*  t,  j 

constant  variance_cr  _ger  subject,  this  difference  has  variance  2 <r  /n. 

The  difference  (Y,  -Y,,  )  has  variance  2<r^(l-p)/n,  where  p  is  the 
*  la  lb  '  r  ' 

correlation  between  the  'Before1  and  'After'  measurements  for  the  same 

subject,  but  is  unbiased  only  if  the  'Before'  measurement  did  not  affect 

the  level  of  the  'After'  measurement.  The  estimates  (Y.  -Y.,  )  and 

2a  lb 

(Y.  -Y,,)  are  themselves  correlated,  since  Y,,  appears  in  both.  One 
*  la  lb  lb 

approach  is  to  seek  a  weighted  mean  of  these  estimates,  with  weights 
determined  from  the  results  of  the  preliminary  test  of  significance  of 

(Y  -Y  ),  that  has  minimum  mean  square  error  subject  to  a  condition 

la  2  a 

that  the  bias  be  kept  small. 

The  preceding  discussion  has  been  confined  to  studies  in  which  it  is 
satisfactory  to  measure  the  causal  effect  at  a  single  time  after  the  causal 
event.  In  many  situations,  the  causal  event  may  ha  ■  prolonged  effects, 
or  if  its  effect  is  likely  to  die  away,  the  investigator  wants  to  measure  this 
decay  curve.  For  these  purposes  we  need,  at  a  minimum,  a  series  of 
measurements  at  intervals  of  time  before  the  event,  followed  by  a  series 
at  intervals  after  the  event.  The  problem  of  the  model  to  be  used  for  the 
analysis  of  results  of  this  type  raises  some  interesting  questions  which 
have  been  illustrated  by  Campbell  and  Stanley  [4]  .  Model -fitting  and 
interpretation  are  easiest  when  the  'Before'  measurements  appear  to 
fluctuate  about  a  constant  level;  the  difficulty  increases  when  the  'Before' 
and  'After'  measurements  display  trends ,  particularly  those  with  curvature. 
The  question  of  serial  correlations  must  also  be  considered. 


3.  2  'Cause  present'  and  'Cause  absent1  groups.  Y  measured  'Alter' 
only.  This  is  a  very  common  type.  The  Hiroshima  and  radiologist  studies, 
investigations  of  the  effectiveness  of  seat  belts  in  preventing  injury  in 
automobile  accidents,  and  the  large  studies  of  the  death  rates  ol  non- 
smokers  and  cigarette,  cigar,  and  pipe  smokers  are  fexamples.  As  we 
have  seen,  there  may  be  several  'cause  present1  groups,  representing 
different  strengths  or  variations  inthe  causal  agent,  and  more  than  one 
'cause  absent'  group,  particularly  where  the  selection  of  a  control  group 
presents  difficulty. 


At  its  simplest,  the  analysis  follow  the  usual  methods  for  the  analysis 
of  one-way  classifications  or  of  two-way  classifications  if  pairing  or  block¬ 
ing  has  been  employed  in  forming  the  groups.  Often,  however,  the  analysis 
of  a  multiple  classification  is  involved,  other  variables  being  introduced 
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in  order  to  diminish  the  risk  of  bias,  as  discussed  in  section  4,  or  because 
the  investigator  wants  to  examine  interactions  of  the  causal  effects  with 
these  variables. 

An  important  variant  of  this  method,  often  called  the  retrospective 
method,  is  much  used  in  epidemiological  research.  In  this,  we  find  a 
group  in  which  the  effect  is  present  and  one  from  which  it  is  absent,  and 
compare  the  frequency  with  which  the  presumed  causal  agent  is  found  in 
the  two  groups.  This  approach  is  natural  when  a  group  of  people  show 
symptoms  of  food  poisoning  at  a  picnic  and  the  cause  is  being  sought.  As 
another  example,  numerous  investigators  have  selected  a  group  of  lung 
cancer  patients  and  another  group  of  patients  in  the  9ame  hospitals  who  do 
not  have  this  disease,  comparing  the  proportions  of  cigarette  smokers  in 
the  two  groups.  With  this  approach,  it  is  often  hard  to  select  the  'effect 
absent'  group  and  to  obtain  measurements  of  high  quality.  Further, 
erroneous  results  may  be  obtained  when  there  are  several  causal  agents 
and  attention  is  focussed  on  one.  But  with  an  effect  that  is  rare,  this 
approach  may  be  the  only  practicaole  one,  and  it  is  often  the  quickest  way 
of  obtaining  a  preliminary  indication  for  or  against  a  postulated  relationship, 
For  a  discussion,  see  [?>]  . 

3.  3  'Cause  present1  and  'cause  absent1  groups,  Y  measured  Before 
and  After.  This  plan  has  been  used,  for  example,  in  studies  of  the  effects 
of  new  public  housing,  as  against  slum  housing,  on  health  and  social 
behavior.  When  it  became  known  which  group  of  applicants  were  to  move 
into  a  new  public  housing  development,  a  control  group  of  families  who 
would  in  general  remain  in  slum  housing  were  selected.  The  basic  question- 
naries  on  health  and  social  behavior  were  obtained  both  before  the  move 
took  place  and  at  several  times  after  the  successful  applicants  had  moved. 

In  a  study  of  the  effects  of  fluoridation  of  town  water  on  children's  teeth, 
usually  done  by  a  plan  of  type  3.  1,  a  nearby  control  town  which  did  not  plan 
to  fluoridate  could  be  included  if  the  resources  permitted.  The  state  of 
dental  health  of  a  sample  of  children  from  both  towns  would  be  measured 
before  and  some  time  after  the  fluoridation  in  the  first  town. 

With  this  plan  the  investigator  is  in  a  better  position  to  guard  against 
bias  than  with  plan  3.  2.  Ideally,  the  initial  distribution  of  the  response 
variable  Y  should  be  the  same  in  the  'cause  present'  and  'cause  absent1 
groups.  Since  he  has  the  initial  measurements,  he  can  verify  whether  this 
seems  to  be  the  case.  Even  if  the  distributions  are  somewhat  different,  it 
is  still  possible  to  compare  the  amount  of  change  in  the  two  groups  during 
the  'Before-After'  period. 

A  general  estimate  of  the  size  of  the  causal  effect  is 

(W  '  e!V72b>  • 
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where  the  value  of  j3  is  to  be  chosen.  Suppose  that  the  model  is  as  follows 
Afkfnrp'  v  s  it  4-  V  “  JJ,  !  c 

'lbj  *  1  “  1  b  j  ’  2bj  r2  2bj 


After:  y.  =  p.  +  6  +  t.  +  t»  . ;  y,  .  =  P,  +  T,  +  e,  , 

'laj  1  1  laj  72aj  n2  2  2aj 


Here,  6  represents  the  causal  effect  to  be  estimated;  and  t0  represent 

other  time-changes  that  affect  the  two  groups;  and  the  e's  are  random 
variables  with  means  zero.  From  this  model  we  see  that 


l  <Yla-Y2a> 


-iJ<VY2b>> 


S  +  (VT  *  ( 1  -3) 


Hence,  (i)  if  t  ^  /  t  ,  the  plan  provides  no  unbiased  estimate  of  6; 

this  is,  of  course,  obvious,  (ii)  ifr^  =  T2  but  H-j  /  P^  (*•  e*  *  ^,e  initial 

levels  of  the  two  groups  differ),  the  only  unbiased  estimate  of  6  is  given  by 
taking  (3=1.  (iii)  if  Tj  =  r  and  p ^  =  p^  >  anY  value  of  j3  gives  an  unbiased 

estimate,  Assuming  that  the  e's  all  have  the  same  variance  <r2 ,  the  estimate 
(3.  1)  has  variance 

2«r2  ( 1  - 2.3  p+(32)/n 


where  p  is  the  correlation  coefficient  between  an  'After'  and  a  'Before' 
measurement.  Ii  the  'After'  and  'Before'  samples  are  independent,  so  that 
p  =  0,  we  take  ,3  =  0.  If  these  measurements  are  paired,  the  minimum 
variance  is  given  by  3  -  p.  In  practice,  ;3  is  estimated  in  this  case  by  an 
analysis  of  covariance  of  the  'After'  on  the  'Before'  measurements. 

4.  TECHNIQUES  FOR  INCREASING  PRECISION  AND  ELIMINATING 
BIAS.  In  controlled  experiments  the  investigator  relies  on  randomization, 
plus  other  precautions  such  as  'blindness'  in  the  measurement  process,  to 
ensure  that  biases  are  kept  to  a  negligible  level.  As  means  of  increasing 
precision,  blocking  and  adjustments  made  by  the  analysis  of  covariance  are 
two  of  the  principal  weapons, 

Devices  analogous  to  blocking  and  covariance  are  commonly  used  in 
non-experimental  studies  also.  However,  since  randomization  is  not 
available,  these  devices  must  perform  the  double  function  of  eliminating 
bias  and  of  increasing  precision.  In  fact,  since  bias  is  regarded  as  the 
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chief  source  of  erroneous  conclusions,  control  of  bias  becomes  their 
principal  function. 

RiippriQo  that  Y  is  the  response  or  effect  va.xia.Lue,  rind  itiai  there  is  a 
'cause  present'  and  a  'cause  absent'  group.  If  X  is  any  variable  that  is 
related  to  Y,  a  bias  may  arise  in  (Y^-Y^)*  the  estimated  difference  between 

the  means  of  the  two  groups,  if  the  distribution  of  X  differs  in  the  two  groups. 
For  instance,  if  the  regression  of  Y  on  X  is  linear, 


Y..  =  p.  +  j3X. .  +  e.., 
ij  i  iJ  ij 

where  i  =  1,  2  denotes  the  group,  and  the  e,,  are  residuals  with  mean  zero, 
then  ^ 

(4.1)  E(YrY2|X)  =  ^  -  p2  +  ;3(X1-X2)  . 

The  term  ^(X^-X.,)  is  the  bias. 

In  handling  these  variables  the  investigator  makes  a  list  of  the  X 
variables  known  or  thought  to  be  related  to  Y.  These  variables  are  placed 
in  one  of  the  following  classes. 

(I)  Important  variables  whose  effects  the  investigator  will  try  to 
remove,  either  because  there  seems  a  danger  of  bias  or  because  removal 
will  bring  a  worthwhile  increase  in  precision. 

(II)  Variables  for  which  the  investigator  will  check  whether  their 
distribution  is  similar  in  the  'cause  present1  and  'cause  absent1  groups. 

No  adjustment  will  be  made  for  these  variables  unless  the  distributions 
appear  sufficiently  different  so  that  there  seems  a  danger  of  bias.  This 
method  is  employed  for  variables  whose  correlation  with  Y  is  modest.  If 
Y  and  X  are  linearly  related,  with  correlation  p,  the  fractional,  reduction  in 
the  variance  of  Y  due  to  elimination  of  the  effect  of  X  cannot  exceed  p^. 

If  f  p  (  <  0.  3,  this  reduction  is  less  than  9%:  the  potential  increase  in 
precision  is  small, 


In  practice,  verification  that  the  distribution  of  X  is  similar  in  the  two 
groups  of  subjects  is  often  done  by  forming  the  frequency  distribution  of  X 
in  each  group,  with,  say,  k  classes,  and  making  the  y?  test  for  a  2  x  k 
contingency  table.  A  low  value  of  x^  is  taken  as  assurance  that  the  distri¬ 
butions  of  X  are  similar  and  that  there  is  little  risk  of  bias  from  the  relation 
between  Y  and  X.  This  test  may  not  be  the  best  procedure.  If  the 
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regression  of  Y  on  X  is  linear,  equation  (4.  1)  shows  that  comparison  of  the 
mean  values  of  X  in  the  two  groups  ^s  more  relevant,  since  the  bias  in 
(Y^-Y^)  comes  from  the  term  (Xt -X.,).  Similarly,  if  the  relation  between 

Y  and  X  is  curved  and  can  be  approximated  by  a  quadratic  regression, 
comparison  of  the  first  two  moments  of  X  in  the  two  groups  is  relevant. 

(Ill)  Variables  about  which  nothing  will  be  done,  because  their  rela¬ 
tion  to  Y  is  judged  too  tenuous  to  create  trouble.  This  class  also  contains 
X  variables  which  it  is  not  feasible  to  measure  and  those  of  which  the 
investigator  is  ignorant. 

A  natural  question  at  this  point  is:  Why  not  put  all  the  X  variables  in 
class  I,  or  at  least  do  so  whenever  there  is  any  doubt?  I  don't  know  the 
full  answer  to  this,  but  a  partial  answer  is  that  the  techniques  (matching 
and  adjustment)  by  which  we  attempt  to  remove  the  effects  of  these  X 
variables  become  steadily  more  cumbersome  to  apply  and  to  interpret  as 
the  number  of  X  variables  increases.  These  techniques  may  be  described 
as  follows. 

'Ideal'  matching,  Each  member  of  group  1  has  a  partner  in  group  2 
who  has,  within  narrow  limits,  the  same  value  for  any  X  variable  for  which 
adjustment  is  being  made.  By  taking  the  difference  between  partners,  the 
effects  of  these  X  variables  are  eliminated,  provided  that  the  regression  of 

Y  on  these  X  variables  is  the  same  in  both  groups.  Clearly,  this  matching 
is  effective  whether  the  regression  is  linear  or  curved, 

In  practice,  the  construction  of  matched  pairs  often  presents  difficulty, 
particularly  if  matching  has  to  be  done  on  several  X  variables,  Usually, 
it  is  necessary  to  have  a  large  reservoir  of  subjects  for  at  least  one  of  the 
two  groups;  otherwise,  it  will  not  be  possible  to  locate  partners  who  agree 
closely  on  the  values  of  all  the  desired  X  variables.  A  common  experience 
is  that  the  construction  of  partners  takes  much  longer  than  anticipated,  that 
the  rules  set  up  about  the  closeness  of  the  match  have  to  be  continually 
relaxed,  and  that  some  subjects  have  to  be  omitted  because  no  match  is 
found. 

Stratified  or  frequency  matching.  This  is  a  looser  form  of  matching 
which  facilitates  the  construction  of  partners.  The  range  of  each  X  variable 
is  divided  into  a  number  of  classes,  commonly  from  2  to  5  or  6.  Thus  the 
X  variables  c, reate  a  multiple  classification:  for  instance,  with  3  X's  and 
4  classes  per  variable  there  are  64  cells,  For  a  member  of  the  'cause 
present'  group,  any  member  of  the  'cause  absent'  group  who  falls  in  the 
same  cell  is  an  acceptable  partner.  In  the  end,  what  this  method  amounts 
to  is  that  in  any  cell  of  the  multiple  classification  the  two  groups  have  an 
equal  number  of  subjects.  Often,  there  is  no  specific  designation  of 
partners,  since  this  seems  rather  pointless. 


Stratified  matching  is  the  only  kind  that  is  feasible  for  an  X  that  is  an 
ordered  classification,  such  as  "mild",  "moderate",  "severe"  or  is 
qualitative,  e.g.  ,  religious  affiliation  or  urban,  suburban,  rural. 

Adjustment  by  subclassification.  This  method  is  very  similar  to 
stratified  matching.  When  selecting  the  'cause  present'  and  the  'cause 
absent1  groups  we  do  not  attempt  any  matching.  Adjustment  for  differences 
in  the  X  distributions  in  the  two  groups  is  accomplished  by  forming  the 
multiple  classification  used  in  stratified  matching  and  making  adjustments 
by  a  least  squares  or  analysis  of  variance  model. 

To  illustrate  the  relation  between  the  two  methods,  suppose  there  are 
X  variables  and  that  only  100  subjects  are  available  for  the  'cause  present' 
group.  To  see  how  the  land  lies,  we  classify  these  subjects,  plus  100  from 
the  'cause  absent'  group,  into  9  cells,  assuming  that  each  X  variable  has 
3  cells.  In  table  1,  the  numbers  of  subjects  found  in  each  cell  are  shown, 

P  and  A  denoting  the  two  groups,  Both  the  P  and  A  sets  add  to  100. 

TABLE  1 


Subclassification  on  two  X  variables. 


If  we  are  using  stratified  matching,  we  select  8  at  random  out  of  the 
2  3  A's  in  the  top  left  cell,  discarding  the  rest.  In  both  the  other  cells  in 
the  top  row,  we  need  more  A's  to  reach  the  desired  numbers  10  and  19. 
Looking  the  table  over,  it  appears  that  a  reservoir  of  perhaps  700  or  more 
subjects  suitable  for  the  'cause  absent'  group  would  be  necessary  to  build 
up  all  the  cells  to  the  desired  numbers  in  the  P  group. 
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In  adjustment  by  subclassification,  as  I  am  using  this  term,  we  either 
accept  the  A  sample  as  it  stands  or  attempt  only  to  build  up  cells  in  which 
the  A  sample  is  very  small.  The  decision  depends  on  the  size  of  the 
reservoir  for  tne  A  group,  the  time  and  trouble  involved  in  any  build  up, 
and  the  investigator's  opinion  as  to  whether  the  effort  is  worthwhile. 


From  the  viewpoint  of  estimation  of  effects  we  face  a  2  x  3  x  3  table 
with  either  stratified  matching  or  adjustment  by  subclassification.  It  is 
assumed  that  Columns  (X^)  and  Rows  (X^)  both  show  real  effects,  and 

possibly  an  interaction,  since  otherwise  there  would  be  no  need  to  match 
or  adjust  for  these  X  variables. 


The  simplest  situation  is  that  in  which  there  is  no  interaction  of  the 
(P-A)  difference  with  either  or  X^.  In  this  event  the  9  differences 

(P..-A,,)  are  all  estimates  of  the  same  quantity.  It  follows  that  with 
ij  *J 

stratified  matching,  the  difference  between  the  overall  sample  means 

(P-A)  is  free  from  any  confounding  with  the  levels  of  X  or  X  .  The 
—  —  2  2  12 
estimate  (P-A)  has  variance  <r  /50,  where  <r  is  the  within-cell  variance 
(assumed  constant  from  cell  to  cell).  If  the  A  sample  is  accepted  as  it 
stands,  the  corresponding  estimate  for  adjustment  by  subclassificatic  i  is 
a  weighted  mean  of  the  differences  (P^-A^),  weighting  each  inversely  as 

The  weights  are  n, ,  n,, .  ./(n. .  +n_ , ,) ,  where  the  n's  are  the 
lij  21/ v  lxj  2ij  - 


its  variance. 


sample  sizes  in  the  (i,j)  cell.  For  table  1  the  variance  of  this  weighted 
mean  difference  turns  out  to  be  o*2/ 3 6 .  6,  about  35%  larger  than  with 
stratified  matching.  In  this  situation  stratified  matching  provides  a 
simpler  estimate  that  is  more  precise. 


We  may,  however,  wish  to  examine  whether  the  (P-A)  difference  changes 
with  the  level  of  X^  and  As  Billewicz  (6]  has  pointed  out,  the  ability 

to  examine  these  interactions  is  an  advantage  which  thene  methods  hold  over 
'Ideal'  matching.  If  interactions  are  found,  estimation  of  the  overall 
difference  may  become  of  little  interest.  The  technique  needed  here  is  the 
analysis  of  multiple  classifications  with  unequal  numbers  in  the  cells. 

While  the  general  least  squares  theory  covering  this  technique  is  not  new, 
much  remains  to  be  learned  about  the  practical  handling  and  interpretation 
of  such  analyses,  particularly  for  investigators  who  are  not  expert  in 
statistical  methods.  The  recent  paper  by  Federer  and  Zelcn  [7]  is  a  useful 
contr  ibution. 


Adjustment  by  covariance.  Conceptually,  this  is  the  same  approach 
as  adjustment  by  subclassification  for  the  case  in  which  the  X's  are 
continuous.  Covariance  may  have  an  advantage  and  a  disadvantage.  The 


3  32 


grouping  of  continuous  X's  into  classes  in  adjustment  by  subclassification 
loses  some  information:  covariance  avoids  this  loss.  On  the  other  hand, 
adjustment  by  subclassification  does  not  involve  any  assumption  that  the 
relation  between  Y  and  X  is  linear.  If  the  investigator  follows  the  common 
practice  of  adjusting  in  covariance  only  for  linear  effects  of  X,  covariance 
is  at  a  disadvantage  if  the  true  regression  has  substantial  non-linearity. 

Of  course,  this  loss  can  be  avoided  by  adopting  a  more  accurate  model  in 
the  covariance  analysis. 

How  effective  are  these  techniques?  The  following  comments  are 
based  on  results  quoted  in  f8]  and  on  some  unpublished  work.  As  already 
mentioned,  'ideal1  matching  removes  bias  due  to  X^.  .  .  under  any 
regression 

Y..  =  |i,  +  <j>(X . X  )  +  e , , ,  (i  *  1,2) 

lj  i  *  hi  kij  ij 


if  the  regression  function  <{>  is  the  same  in  both  groups,  The  variance 
of  (Yj-Yg)  is  reduced  by  the  matching  to  a  fraction  (l-p^)  of  its  original 

value,  where  p  is  the  correlation  coefficient  between  Y  and  tf>  •  I*1  practice) 

'ideal'  matching  is  likely  to  be  at  its  best  when  the  X's  are  quantitative  and 
one  of  the  groups  has  a  large  reservoir  in  which  matches  may  be  sought, 
while  the  other  group  is  small.  In  this  situation,  matching  should  not 
prove  too  difficult.  Moreover,  the  other  disadvantage  of  matching --that 
one  cannot  examine  effectively  the  interactions  of  the  causal  variable  with 
the  X  variables--scarcely  applies  when  one  group  is  small,  since  the  sample 
size  would  probably  preclude  any  precise  estimates  of  interactions. 

Covariance  adjustment  should  have  about  the  same  effects  on  bias  and 
precision,  with  the  qualifications  that  the  correct  form  of  the  regression 
equation  must  be  fitted,  and  that  there  is  some  loss  of  precision  from 
sampling  errors  in  the  estimated  regression  coefficients.  If  the  regression 
is  linear  and  the  re  happens  to  be  no  bias  due  to  the  X's,  the  fraction  to 
which  V^^-Y^)  is  reduced  by  the  uovarlaru:  e  adjustment  is  roughly 


' 

1  +  _ _ £ _ , 

(2n-k  ■  3) 

where  n  is  the  size  of  sample  in  each  group,  so  that  the  regression 
coefficients  are  estimated  from  2(n-l)  degrees  of  freedom.  The  term  in 
curly  brackets  will  be  close  to  1  if  k  is  small  relative  to  2n.  However,  if 
there  are  substantial  biases  in  some  of  the  X's,  (4.2)  no  longer  applies, 
and  the  corresponding  term  in  curly  brackets  can  be  much  larger.  The 


(4.2)  (1-P2) 
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performance  of  this  covariance  adjustment  when  the  fitted  model  is  of 
the  wrong  form  deserves  further  study.  Linear  covariance  adjustments 
seem  to  periorm  surprisingly  well  when  the  true  regression  has  a  moderate 
degree  of  curvature. 


The  preceding  remarks  about  matching  and  covariance  assume  that  the 
X's  are  measured  without  appreciable  error.  Suppose  that  for  an  X 
variable  the  recorded  measurement  is  x  *  X  +  d,  where  £  is  a  random 
error  of  measurement  with  mean  zero,  independent  of  X  and  of  £,  the 
deviation  of  Y  from  its  regression  on  X,  The  effects  of  these  errors  of 
measurement  are  roughly  as  follows,  where  f  =  tr^/<r^  =  cr^/(crx  + 

(i)  Matching  and  covariance  remove  only  a  fraction  (l-f)  of  the 
bias  in  Y  due  to  X. 

2 

(ii)  V(¥^-¥ )  is  reduced  to  the  fraction  {1  -  (l-f)  p  }  of  its 
original  value. 


While  imprecise  measurement  weakens  the  performance  of  these 
techniques,  it  is  easy  to  form  an  exaggerated  notion  of  the  size  of  this 
effect  if  some  check  calculations  are  not  made.  For  instance,  suppose 
that  =  25,  nearly  all  the  correct  values  of  X  lying  between  0  and  125. 


If  we  are  told  that  half  the  observed  measurements  are  wrong  by  more 
than  5  units,  this  seems  rather  poor  quality  of  measurement.  However, 
a  probable  error  of  5  corresponds  to  c'^  a  7.4,  tr^  =  55,  =  680,  and 

f  =  0.  08.  Thus,  92%  of  the  bias  is  still  removed. 


Now  consider  stratified  matching  and  adjustment  by  subclassification 
as  applied  to  quantitative  X's.  From  the  viewpoint  of  errors  of  measure¬ 
ment  of  X  ,  these  methods  appear  crude,  since  the  quantitative  scale  of  an 
X  variable  is  replaced  by  a  classified  variable  that  takes  only  the  number 
of  distinct  values  that  the  number  of  classes  allow.  With  stratified  match¬ 
ing  the  values  of  (L-f)  are  0.  64,  0.79,  0,86,  0.90,  and  0,  92  lor  l ,  J ,  4 ,  5 , 
and  6  classes,  respectively.  Strictly,  these  values  hold  only  il'  the  regres¬ 
sion  of  Y  on  X  is  linear,  X  is  normally  distributed,  and  the  classes  are  of 
equal  size,  However,  they  appear  accurate  enough  as  guides  to  practice 
when  the  regression  of  Y  on  X  is  nonlinear,  when  X  has  some  skewness  and 
kurtosis  and  when  the  class  sizes  depart  moderately  from  equality.  The 
results  indicate  that  at  least  five  or  six  classes  should  be  used  for  any  X 
variable  which  is  thought  to  be  a  source  of  substantial  bias. 


With  adjustment  by  subclassification  the  preceding  (l-lj  values  apply 
so  far  as  the  removal  of  bias  due  to  X  is  concerned.  This  method  suffers 
an  additional  loss  of  precision,  as  illustrated  previously,  because  of 


inequalities  in  the  sample  sizes  of  the  two  groups  in  the  individual  cells  of 
the  multiple  classification. 

The  situation  when2£  is  an  ordered  classification  is  not  so  clear.  If 
an  ordered  classification  can  be  regarded  as  essentially  a  grouping  of  an 
underlying  quantitative  X,  the  preceding  values  of  (l-f)  should  be  applicable. 
In  practice,  however,  ordered  classifications  are  often  used  because  no 
more  precise  method  of  measurement  is  known.  If  we  envisage  some 
accurate  measurement  X,  not  yet  discovered,  it  seems  reasonable  that 
the  ordered  classification  will  contain  errors  of  misciassification  as  well 
as  grouping  errors.  These  additional  errors  presumably  reduce  the  values 
of  (l-f),  to  an  extent  that  does  not  seem  to  have  been  investigated. 

Finally,  none  of  the  methods  can  guarantee  to  remove  bias  due  to  an 
X  variable  that  has  been  omitted  from  the  matching  or  adjustments.  The 
situation  with  regard  to  such  omitted  variables  is  interesting.  If  they 
happen  to  have  a  high  correlation  with  the  included  X's--in  other  words,  if 
we  are  lucky- -most  of  their  bias  will  also  be  removed  by  the  matching  or 
adjustments.  This  explains,  I  think,  why  linear  covariance  often  works 
well  when  Y  has  a  quadratic  regression  on  X,  since  X  and  X^  have  a  high 
correlation  in  many  bodies  of  data.  But  one  can  also  meet  the  opposite 
situation  in  which  the  bias  due  to  omitted  X's  is  inflated  by  the  adjustments. 
Thus  in  non-experimental  studies  there  always  remains  an  element  of 
uncertainty  in  our  claims  about  the  size  and  reality  of  a  presumed  cauBal 
effect. 
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A  MODERATELY  DISTRIBUTION  FREE  APPROACH  TO 
RELIABILITY  ESTIMATION  BASED  ON  THE 
FIRST  ORDER  STAli&llC  ' 

Michael  G.  Billings 

C-E-I-R,  Inc.  ,  Dugway  Field  Operation 
Dugway,  Utah 

ABSTRACT.  This  paper  describes  a  small  sample  reliability  test 
design  and  evaluation  technique  based  on  properties  of  the  first  order 
statistic.  The  technique  is  "moderately  distribution  free"  in  that  it  is 
applicable  to  any  problem  which  satisfies  the  following  conditions:  1)  The 
random  variable  X  involved  is  continuous;  2)  X  can  take  on  only  non¬ 
negative  real  values:  3)  the  "mission"  of  the  system  under  investigation 
is  a  set  of  real  numbers  of  the  form  [T,  »),  where  T  >  0,  4)  there  exists 
a  set  of  real  numbers  MC  [  1 ,  »)  such  that  m  «  M  =  >  F  (mT)  >  mFv(T), 

where  is  the  distribution  function  of  X.  Some  sufficient  conditions 
are  given  which  define  classes  of  distributions  to  which  the  technique  is 
applicable.  Also,  it  is  shown  that  the  technique  is  a  highly  accurate 
approximate  procedure  for  reliability  evaluation  whm  in  fact  the  random 
variable  X  involved  has  an  exponential  distribution,  s-t  that  Condition  4  is 
not  satisfied.  Finally,  a  brief  consideration  of  the  Weibull  distribution  is 
presented. 

1.  INTRODUCTION.  The  purpose  of  this  paper  is  to  derive  and 
demonstrate  a  small  sample  reliability  test  design  and  evaluation  technique 
which  appears  to  have  applicability  over  a  wide  class  of  distributional 
forms.  The  technique  derived,  referred  to  as  the  Modified  Distribution 
Free  (MDF)  technique,  is  based  upon  certain  properties  of  order  statistics 
and  is  conceptually  similar  to  the  strictly  distribution  free  binomial  approach 
to  reliability  evaluation.  The  MDF  technique  introduces  certain  fairly 
nonrestrictive  assumptions  in  order  to  achieve  a  trade  off  between  sample 
size  and  system  performance .  Before  proceeding  it  will  be  useful  to 
introduce  and  interpret  the  concepts  and  symbols  which  will  be  encountered 
in  the  ensuing  discussion. 

Technically,  the  term  reliability  is  always  used  relative  to  some 
system,  conceptual  or  real,  the  primary  purpose  of  which  has  been  deter¬ 
mined  to  be  the  accomplishment  of  a  specific  objective  called  the  system 
mission.  The  reliability  of  the  system  is  defined  to  be  the  probability  that 
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the  system  will  accomplish  its  designated  mission.  In  order  to  meaning¬ 
fully  discuss  system  mission  reliability  it  is  necessary  to  establish  a 
method  for  measuring  system  performance  relative  to  the  particular 

mi  R  flinn .  For  this  nurnnsp  it  i  «i  rr.hupni#mt-  tn  ?.;?urr.2  the  existence  cf  C 

population  of  systems  of  the  type  under  consideration.  On  this  popula¬ 
tion  a  random  variable  X  is  defined  in  such  a  way  that  the  mission  can  be 
characterized  as  a  subset  T*  of  the  probability  space  J ^  induced  from 
yj  by  X.  If  the  probability  measure  on  „  is  P  and  the  associated  dis¬ 
tribution  function  of  X  on  jr'  is  F„,  then  the  definition  of  system  mission 
reliability  becomes 

*  T*)  =  \  dF 

•Jji  .it 


P  {*«  J*xf*«T*}  *  Pr(X 


In  practice  it  is  desired  to  obtain  an  interval  estimate  of  the  system 
reliability  to  which  we  are  able  to  attach  a  measure  of  assurance  that  the 
interval  contains  the  true  reliability.  Conventionally,  this  has  been  done 
as  follows: 

1)  One  obtains  for  \  dF  a  confidence  interval  estimator 

”  T*  a  X 

which  depends  on  an  estimator  F  of  F^; 

2)  A  value  of  £  is  then  observed,  and  the  corresponding 

confidence  interval  estimate  for  \  dFv  is  calculated; 

J  T*  X 

3)  The  confidence  coefficient  associated  with  the  interval  estimator 

for  \  dF  is  taken  to  be  the  measure  of  assurance  (confidence)  that  the 
'J  ij>  >;<  " 

calculated  interval  estimate  contains  the  true  value  of 


The  result  of  this  procedure  is  a  statement  of  the  form  "with  y- 
confidence  the  reliability  is  at  least  a",  hereafter  abbreviated  r(a,y), 
where  a  is  the  lower  bound  of  the  interval  estimate  obtained  for 


r. 


T* 


dFv,  and  y  is  the  associated  confidence  coefficient. 


accordance  with  convention,  if  X  is  a  continuous  random  variable  with 

x.  -If  X  is 


density  function  £, v  «  (dF  )/(dx),  then  (  dFv  =  (  L-d 
a  X  a  A 

discrete,  then  the  integral  i  dFv  is  a  sum  over  the  set  T*. 

J  IT.,.,  X 
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Under  certain  assumptions  on  the  random  variable  involved,  it  is 
possible  t~  equivalently  formulate  the  reliability  evaluation  problem 
within  a  hypothesis  test  framework.  The  Modified  Distribution  Free 

technique  described  in  GccLion  2  utilizes  this  approach. 

2,  THE  MODIFIED  DISTRIBUTION  FREF.  (MDF)  TECHNIQUE. 

The  MDF  approach  to  reliability  estimation  presupposes  that  the  follow¬ 
ing  conditions  are  satisfied  by  the  particular  problem  involved: 

1)  The  random  variable  X  under  consideration  is  continuous. 

2)  Jx  -  {xj  x  >  0  }  . 

3)  The  mission  T*  can  be  described  by  T*  =  {x«  J  lx>T,  Tij^  }. 

A  A 

4)  There  exists  a  set  of  real  numbers  MC[l,»)  such  that 

m«M  =5»  F(mT)  >  mF(T) , 


where  T  is  given  in  Condition  3,  and  F(x)  =  Pr{X  <  x). 

The  particular  hypothesis  test  structure  employed  in  the  MDF 
approach  is  described  as  follows;  Suppose  it  can  be  assumed  that  m*M 
(see  Condition  4  above),  and  that  it  is  desired  to  either  conclude  or  fail 
to  conclude  the  reliability  statement  r(a,y)  on  the  basis  of  a  sample  of 
size  n  from  tS Let  |3  be  determined  so  that  (l-3)/m  =  1-a,  and  let 

the  sample  size  n  be  such  that  1  -y  =  pn.  Let  the  null  hypothesis  be  given 
by 


H0:F(mT)  >  l-(3; 

and  the  alternate  hypothesis  be  given  by 


H  :  F(mT)  <  1  -(3 


The  test  statistic  to  be  used  is  X#lV  the  first  order  statistic,  and  H.  will 

2/  l1'  0 

be  rejected  if  X^  >  ml,-' 

It  is  clear  that  Pr(Reject  Hq  |  HQ  is  true)  <  1-y,  since 


__ 

-'Note  that  this  means  testing  can  be  truncated  once  each  sample  system 
has  operated  for  mT  units. 
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Pr(X^  >  mT  |  HQ  is  true)  =  [  1  -  F(mT)  ]  n  <  pn 

=  i  -y  - 

Thus,  if  Hq  is  rejected,  it  is  concluded  that  F(mT)  <  1-J3,  and  the 

significance  level  of  the  test  does  not  exceed  1-y.  By  virtue  of  the 
initial  assumption  that  F(mT)  >  mF(T),  rejection  of  Hq  implies  that 

F(T)  <  (l -P)/m  »  l-o.  The  probability  that  is  accepted  erroneously 

does  not  exceed  1-y;  thus,  if  HQ  is  rejected  the  conclusion  is  r(a,y). 

The  usefulness  of  the  MDF  approach  as  a  design  tool  when  the 
appropriate  conditions  and  assumptions  are  satisfied  is  evident  from 
Proposition  1; 

Proposition  1.  Suppose  for  a  given  reliability  estimation  prob¬ 
lem  that  the  Conditions  1-4  above  are  met.  Suppose  further  that  m«M 
(i.e.  ,  F(mT)  >  mF(T)  ).  If 


and  if  the  null  hypothesis  Hq  of  the  MDF  hypothesis  test  is  rejected 
(i.e.,  «X(1)  >  mT),  then  r(o,y). 

Proof:  This  follows  immediately  from  the  relations  ( 1  -p)/m  = 

1  -a;  and  (3n  =  1  -y. 

Making  use  of  this  result,  it  is  possible  to  construct  tables  which  give 
sample  sizes  from  which  rejection  of  Hq  will  lead  to  conclusion  r(a,y)  for 
various  values  of  m  and  reliability-confidence  level  combinations  (a ,y). 
An  abbreviated  table  is  presented  below.  The  entries  are  the  minimum 
sample  sizes  necessary  for  rejection  of  Hq,  i.e.  ,  when  >  mT,  to 
lead  to  the  conclusion  r(a,y)  under  the  assumption  that  F(mT)  >  mF(T) 
for  the  value  of  m  shown. 
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TABLE  2.  1 


»  r  / 

m 

(.99.  .90) 

NHB9KEM 

1 

230 

45\ 

22 

2 

114 

22 

11 

2.5 

91 

18 

9 

3 

76 

15 

7 

3.75 

61 

12 

5 

4 

57 

11 

5 

5 

45 

8 

4 

6 

38 

7 

3 

Example.  If  the  mission  is  T  =  50  hours,  and  if  it  can  be 
assumed  that  F(4T)  >  4F(t),  then  to  be  able  to  conclude  r{„95,  ,90)  a 
minimum  of  11  systems  would  have  to  operate  successfully  for  at  least 
4- T  =  200  hours.  (See  Footnote  2.) 


The  MDF -hypothesis  test  can  also  be  used  to  provide  descriptive 
reliability  statements,  i.  e.  ,  statements  of  the  form  r(a,y)  based  on  the 
actually  observed  value  of  Suppose,  for  example,  that  for  a 

sample  of  n  systems  we  observe  =  m*T,  and  it  can  be  assumed  that 

m’l'sM.  It  is  then  possible  to  determine  the  strongest  statement  r(a,y), 
which  can  be  made  on  the  basis  of  the  test,  for  a  fixed  upper  bound  1-y 
on  the  significance  level  by  solving  the  equation 


-  log(i-v) 

log(l  -m(  1 -a)  ) 


for  o  .  It  is_also  possible  to  determine  the  highest  confidence  which  can 
be  associated  with  a  given  reliability  level  a  on  the  basis  of  the  test. 


Example .  Suppose  T  =  50  hours  and  a  sample  of  17  systems 
yields  a  value  of  =  178  hours.  Further,  suppose  it  is  possible  to 

assume  that  178/50  <  M.  Then  the  strongest  statement  of  the  form 
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r(a,  .  90)  which  can  be  concluded  on  the  basis  of  the  MDF-hypothesis  is 
r(.  9645,  .  90) ,  obtained  by  solving  the  equation  ' 


17  = 


log(.  10) 

logfl  -3.  56(1 -a)  ) 


for  a  . 


Example.  On  the  basis  of  the  performance  described  in  the 
previous  example,  the  statement  r(.9S,  .964)  could  also  be  concluded. 

3.  APPLICABILITY  OF  THE  MDF  APPROACH.  Whether  the 
MDF-approach  can  be  applied  to  a  particular  problem  depends  on  the 
extent  to  which  the  experimenter  can  justify  the  necessary  assumptions 
regarding  the  problem  and  the  distribution  function  involved.  The  purpose 
of  this  section  is  to  discuss  certain  fairly  nonrestrictive  conditions  which 
define  classes  of  distributions  to  which  the  MDF-approach  is  applicable. 

It  will  also  be  shown  that  the  MDF-approach  is  a  highly  accurate  approxi¬ 
mate  procedure  for  reliability  evaluation  when  the  distribution  involved 
is  exponential,  and  thus  does  not  satisfy  Condition  4. 

Proposition  2  establishes  that  the  MDF-approach  is  applicable  to  a 
fairly  commonly  occurring  class  of  distributions. 

I  Proposition  2.  Let  X  be  a  continuous  random  variable  with 

*  { x  |  x^>  0  } ,  and  let  Te  If  the  density  function  f(x)  is  mono¬ 

tone  nondecreasing  on  [0,mT]  ,  where  m>  1,  then  F(mT)  >  mF(T). 

Proof:  Letx*[T,mT]  .  By  hypothe sis ,  f(x)  is  monotone  non¬ 
decreasing  on  [0,  x]  ,  so-that  x*  f(x)  >  Ffx).  Thus  f(x)/F(x)  >  l/x.  Since 
this  is  true  for  every  x«[T,mT]  ,  it  follows  that  " 


This  is  equivalent  to  saying  that 

,  F(mT)  _  ,  ,mT. 

log  F(f)  >  loS  (“r")  =  log  m 

or  that  F(mT)  >  mF(T).  Q.E.D. 
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From  Proposition  2,  it  is  immediate  that  if  £{x)  is  monotone  non¬ 
decreasing  on  [0,4  ]  .  where  F(4  )  =  p,  then  the  MDF  technique  is 
P  P 

frit  all  valnoo  rtf  rr\  iinrVi  tV>at  rv-s  ("l  £  /T  1 

-  - -  *  -  -  “  ^'^p/  ‘  j  * 

It  is  appropriate  here  to  point  out  that  the  hypothesis 


Hfl:  F[mT)  >  1-J3 

is  logically  equivalent  to  the  statement 

mT  >  • 


Hence,  could  be  written  in  the  more  illuminating,  if  redundant,  form 

HqS  F(mT)  >  l  -P  and  mT  >  4  ^  ^  , 

Therefore,  to  reject  is  to  conclude  that 


F(mT)  <  1-p 

and 


Thus,  for  example,  if  one  can  assume  that  f(x)  is  monotone  nondecreasing 
on  [0,4p]  ,  and  if  1-p  <  p,  then  acceptance  of  Hj  implies  the  simulta¬ 

neous  validity  of  the  relations  F(mT)  >  raF(T)  and  1-p  >  F(mT).  ThuSy 
acceptance  of  implies  that  ”  ~  i 


F(T)  < 


1  -  P 

m 


so  that 


is  concluded. 


Example.  Suppose  it  can  be  assumed  that  the  density  function 
involved  is  monotone  nondecreaaing  on  [0,£  ]  .  Two  problems  are 

considered;  1)  Design  a  test  which  will  determine  whether  the  conclusion 
r(.95,.90)  is  valid  onthe  basis  of  a  sample  of  size  21;  2)  given  a  sample 


of  size  9(  and  X^  >4.  3T,  what  is  the  strongest  statement  of  the  form 
r(a,  .  90)  that  can  be  concluded? 


Solution  to  Problem  1;  The  MDF “hypothesis  test  here  can  be 
expressed  as  follows:  We  are  given  n  ■  21 ,  1  -y  =  .  10,  a  =  .  95.  Thus 
(l-P)/m  ■  .  05,  and  p^*  *  1-y  *  .10.  Thus,  p  =  .  896,  so  that  .  104/.  05  ■« 
m  b  2.  08.  Hence,  the  hypotheses  are 

Hq:F(Z.08T)  >  .  104 

and 


H,:  F(2.  08T)  <  .  104. 

i 

Hq  is  rejected  if  X^  >  2.  08T.  If  X^  >  2.  08T,  then  it  is  accepted  that  ! 

F(2.  08T)  <  .104and2.08T<  |_104<  ^5Q(  so  that  .  104  >  F{2.  08T)  >  2.  08  F(T),  j 

with  at  least  y-confidence;  i.e.,  if  X^j  >  2.  08T ,  r(.95,.90)  is  concluded. 

Hence, 

Hence,  suppose 

HQ:F(4.  3T)  >  .2355 
and 

H1:F(4.  3T)  <  .  2355. 

Then,  X^  >  4.  3T  results  in  the  conclusion  r(.  9476, .  90). 

We  now  compare  the  results  of  applying  the  MDF -technique  to  a 
situation  in  which  the  random  variable  X  involved  actually  has  the 
exponential  distribution  with  f(x)  ■  Ae"AX,  A  >  0,  and  thus  does  not 
satisfy  Condition  4  for  any  value  of  m  >  1.  Table  3.  1  provides  compari¬ 
sons  of  MDF-approach  results  with  lower  .90  confidence  bounds 
(l-pn  a  ,  90)  for  the  reliability  obtained  under  the  assumption  that  X 
actually  has  the  exponential  distribution  and  X^^  >  mT  for  the  sample 


i 


Solution  to  Problem  2: 


Here  n  «  9,  m  »  4.  3  and  1-y  *  ,  10, 
.10,  so  that  P  ■  .  7745,  and  a  »  ,  0524. 
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sizes  and  values  of  m  shown.  The  hypothesis  test  involved  in  obtaining 
the  bounds  under  the  exponential  assumption  is  nearly  identical  to  the  MDF 
hypothesis  test,  with  the  exception  that  the  hiDF  aeautMpiion  F(mT)  _> 
mF(T)  is  omitted  and  the  bound  for  F(T)  is  obtained  from  the  fact  that 
when  X  has  the  exponential  distribution 

F{mT)  <  1-p  =>  F(T)  <  l-^1//m  . 

The  validity  of  this  implication  is  seen  as  follows: 

F(mT)  *  l-e"AmT  <  1-p  e‘AmT  >  p  <*> 

e'XT  >  p1//m  <=>  l-e‘AT  <  l-p1/”1 

<=>  F(T)  <  1  -p1/"1  . 


TABLE  3.  1 


MDF- 

Conclusion 

r(.  99, . 

90) 

r(.  95, .  90) 

r(.  90, ,  90) 

m 

n 

rsm 

n 

fS£M 

n 

IH 

1 

230 

.990 

45 

.950 

22 

.900 

2 

114 

.990 

22 

.949 

11 

.895 

2.  5 

91 

.990 

18 

.948 

9 

.892 

3 

76 

.989 

15 

.947 

1 

.888 

3.75 

61 

.989 

12 

.946 

n 

.881 

4 

5? 

.988 

11 

.  946 

.881 

5 

45 

.988 

8 

.944 

IS 

.871 

6 

38 

.987 

■ 

.943 

3 

.858 

For  example,  the  MDF -approach  conclusion  based  on  a  sample  of  76 
systems  and  X^  £  3T  is  r(.  99, .  90).  The  corresponding  conclusion 

based  on  the  assumption  that  X  has  an  exponential  distribution  is 
r(.  989, .  90). 
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It  is  possible  to  analytically  explain  the  lack  of  sensitivity  of  the 
MDF  approach  to  certain  types  of  departures  from  Condition  4.  In 
particular,  the  condition  F(mT)  >  mF(T)  for  every  m*[l,  0]  is 
equivalent  to  saying  that  for  xc[T,6T]  the  distribution  function  F(x) 
Hnmlnatoa  fimrHnn 


(See  Figure  1.) 


FIGURE  1 


For  x<  f  0,  T/f(T)]  ,  L>(x)  can  be  thought  of  as  the  distribution  function  of 
a  random  variable  which  is  uniformly  distributed  on  [0(T/F(T)]  .  Thus, 
if  in  reality  X  has  a  distribution  function  with  the  property  that  F(mT)  < 
mF(T)  for  m  >  1,  then  F(x)  will  be  dominated  by  L(x),  the  slope  of 

which  shall  be  "small"  when  T/F(T)  is  large. 

What  happens  whenX  has  the  exponential  distribution,  with  F(x)  = 

1  -  e"**,  is  this  (see  Figure  2):  If  Ax  is  small,  i.e.  ,  if  l/A  is  large 
relative  to  x,  then  1  -  e“  x  =  Ax.  That  is,  F(x)  is  closely  approximated 
by  the  distribution  function  of  a  random  variable  which  is  uniformly 
distributed  on  (0,l/Aj  .  Since  A  =  F(x)/x  for  small  values  of  Ax,  if  AT 
is  small  F(T)/T  =  A,  so  that  F(x)  =  xF(T)/T  =  L<(x),  which  accounts  for 
the  relatively  small  error  in  the  MDF  conclusions  for  small  values  of  m. 
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4.  A  CONSIDERATION  OF  THE  WEIBULL  DISTRIBUTION.  The 
Weibull  distribution  occupies  an  important  position  in  the  theory  of 
reliability.  Thus,  it  is  useful  to  compare  MDF  results  with  those 
obtained  unde r  the  assumption  that  the  random  variable  under  considera¬ 
tion  has  the  Weibull  distribution.  For  these  comparisons,  it  is  assumed 

6 -1  -Ax® 


that  the  density  function  of  X  is  given  by  f(x)  a  Ax  e 


whe  re  x  >  0 , 


0  >  1  and  A  >  0.  The  distribution  function  of  X  is  F(x)  =  1  -  e 
Statements  of  the  form  r(a,y)  can  be  obtained  for  this  case  using  the  MDF 
hypothesis  test  structure  with  the  implication  F(mT)  <  1  -  (3(m) 

=>  F(T)  <  *  P(m)  replaced  by  the  implication 


F(mT)  <  1  -  p(m)  a>  F(T)  <  1  -  0(m)]  m .  Note  that  this  is  the  same 
substitution  which  was  made  in  Section  3  when  statements  r(a,y)  were 
obtained  for  the  exponential  distribution  using  the  MDF  hypothesis  test  ^ 

structure.  That  the  implication  F(mT)  <  1  -  p(m)  =>  F(T)  <  1  -[p(m)]^ 
is  valid  when  X  has  the  Weibull  distribution  with  0  >  1  is  seen  as  follows: 
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F(mT)  <  1  -  p(m)  <=>  1  -  e'^mT^  <1  -  p{m) 

-XT6me  .  -XT0  ...  ..Tnl 

<=>  e  >P(m)  <=>  e  >  [0(m)1 

1 

<*>  1  -  e“XTe  <  1  -  [P  ftn)]  m&  <=>  F(T) 

“V 

<  1  -  CP(m)]m 


—  r  * 

But  1  -  [p(m)]n‘  <  1  -  £p(m)]“*  since  £ 0, 1]  ,  m  <  1  and  0  >  1  . 

Thus,  if  X^j  >  mT,  where  Pr(X^  >  aT  |  is  true)  <  1  -  y,  the 

statement  r([  P  (m)]  m,y)  may  be  concluded. 


Example .  In  [2]  ,  Ldeberman  and  Johns  have  presented  a 
method  for  estimating  reliability  when  the  random  variable  involved 
has  a  Weibull  distribution.  Section  6  of  [2]  presents  an  illustrative 
example  in  which  the  reliability  of  a  system  for  a  mission  of  T  e  40 
hours  is  estimated,  with  y  s  90  confidence,  on  the  basis  of  the  following 
observations  on  the  first  five  order  statistics:  X^j  a  50,  X^  a  75, 


X^  a  125,  X^j  =  250  and  -  300*  The  sample  size  used  is  10. 

Using  the  estimation  method  they  derived,  the  authors  conclude  r(.796,  .90). 
Had  the  authors  simply  used  binomial  reliability  tables  [l]  ,  they  would 
have  concluded  r(.794,  ,90),  since  no  mission  failures  occurred  in  10 
trials.  By  way  of  comparison,  if  one  employed  the  MDF  technique  under 
the  assumption  that  f(x)  is  monotone  increasing  on  [0,  {  )  for  any  p  <  .  20, 


the  conclusion  would  be  r(.835,  .90),  while  if  one  were  to  utilize  the  MDF 
type  technique  adapted,  as  described,  to  the  assumption  that  X  has  the 
Weibull  distribution  with  0  >  1,  the  conclusion  would  be  r(.  832, .  90).  It 
should  be  noted  that  the  estimating  procedure  of  Lieberman  and  Johns 
does  not  involve  any  assumptions  on  the  values  of  0 ,  which  at  least 
partially  explains  the  relatively  small  difference  between  their  estimate 
and  the  binomial  estimate.  The  MDF  technique  becomes  increasingly 
less  accurate  as  an  approximate  method  as  0  approaches  0.  Therefore, 
caution  should  be  used  in  applying  the  technique  to  a  Weibull  situation  if 
it  is  suspected  that  0  is  actually  less  than  1. 


REFERENCES 


1.  Confidence  Limits  for  Attribute  Data,  second  revision, 
Liuciuietsd  Jv£i»e»ue  and  Space  Company,  Sunnyvale,  California,  1966, 
pp.  1-87. 

2.  M.  V.  Johns,  Jr.,  and  G.  J.  Lieberman,  "An  Exact 
Asymptotically  Efficient  Confidence  Bound  for  Reliability  in  the  Case 
of  Weibull  Distribution",  Technometric  a ,  v.  8,  1966,  pp.  135-175. 


« 


349 


RELIABILITY  IN  COMPLEX  SYSTEMS* 


A.  Clifford  Cohen 
The  University  of  Georgia 

1.  INTRODUCTION.  It  is  well  established  that  the  reliability  of 
complex  systems  varies  with  time.  Following  a  break-in  and  adjustment 
period  during  which  minor  deficiencies  are  corrected,  a  system  is  placed 
in  service  with  an  initial  reliability  R^.  Thereafter  the  reliability  either 

increases  as  further  system  deficiencies  are  corrected  or  it  decreases 
as  components  deteriorate  with  age.  In  the  life  of  some  systems,  there 
is  an  early  period  during  which  reliability  increases,  and  a  subsequent 
period  of  constant  or  decreasing  reliability.  Our  attention  in  this  paper 
Is  limited  to  models  of  monotone  increasing  and  monotone  decreasing 
reliability. 

2.  EXPONENTIAL  MODELS. 

Increasing  Reliability.  With  R(t)  designating  reliability  at  time  t, 
a  simple  exponential  model  for  increasing  reliability  may  be  expressed 
as 

(1)  R(t)  =  1  -  (1  -  RQ)e~  a  *,  a  >  0,  t  >  0  , 

where  Rq  and  a  are  parameters  to  be  estimated  from  sample  data. 

Decreasing  Reliability.  When  reliability  decreases  with  time,  we 
consider  the  following  relationship 

(2)  R(t)  a  RQe‘  °  t,  a  >0,  t  >  0  , 

where  again  R  and  a  are  parameters  to  be  estimated  from  sample 
data. 

3.  MAXIMUM  LIKELIHOOD  ESTIMATION.  Let  n.  specimens  be 
tested  at  time  t^  and  let  designate  the  number  of  successes  achieved 

(i  =  0,  1,  . ,  .  k).  Sample  data  resulting  from  a  sequence  of  such  tests 

then  consist  of  the  triples  (t  ,  n  ,  x  ) ,  ft. ,  n, ,  x.) ,  . .  .  (t.  ,  n.  ,  x. ). 

ooo  lll  kkk 

From  these  data,  we  must  determine  which  model  is  appropriate  (i.  e. 

*  Reprinted  with  permission  of  the  American  Society  for  Quality  Control 
from  Transactions  of  the  Twentieth  Annual  A.  S.Q.C.  Technical  Confer¬ 
ence,  June  1966. 


increasing  or  decreasing)  and  then  estimate  the  parameters.  Before 
examining  the  problem  of  choosing  between  w*  will  first  cor.cid 

estimation  in  each  model  separately. 

With  the  reliability  R^)  at  time  t^  abbreviated  to  R^,  the  likeli¬ 
hood  function  for  our  sample  may  be  expressed  as 

(3)  *o*'  *  *  11  “  *^'*1  . 

i*0 

Estimation  in  the  Increasing  Model.  When  R  is  given  by  equation 
(l),  we  make  this  substitution  In  (3)  to  obtain 

(4)  L  (R  ,  a)  =  II  ("i)[l  -  (1  -  R  )e“a  fci]  Xi[  (l  -  R  )e"a  h]  VXi. 

0  i*0  Xi  °  ° 


where  the  subscript  indicates  employment  of  the  increasing  model. 

On  taking  logarithms  of  (4)  differentiating  with  respect  to  R  and  a 
in  turn,  we  obtain 


»,  .  ^  (n.  -  x.)  ,  -o  tl 

SlnLj  o  *  i  i  k  x^e  1 

9R~"  *  "  TTr  +  ?  1  -  (1  -  R  )e“a  *i 
o  o  0  '  o 


"a 

j.  k  k  iXie 

-  =- 1  *,(»,-*,)  + (i  •  R0)  =  -- 

0  0  1  -  (1  -  RQ)e 


-ati  ‘ 


On  setting  these  equations  equal  to  zero  and  simplifying,  estimat¬ 
ing  equations  in  the  case  of  increasing  reliability  become 


r- 


k  x  e  a  k 

p  )  s  - i - -  =  r  (n.  -  X.)  , 

°  0  1  -  fl  -  R  )e"ati  0  1  1 

*  o 


k  t  x  e  *  k 

(1  -  R  )  r  - — - —  =  2  t  (n  -  x.)  . 

°  0  1  -(1  -R  )e‘ati  0  1  1  1 

'  o 


When  (1)  is  the  appropriate  model,  the  required  estimates  R  and 

o 

a  can  he  found  by  simultaneously  solving  (6)  using  standard  iterative 
techniques.  Should  the  value  a  thereby  obtained  from  some  given  sample 
turn  out  to  be  negative,  this  suggests  that  the  increasing  model  is  inappro¬ 
priate  and  that  we  should  either  set  Q  =0  or  investigate  the  decreasing 
model  of  equation  (2). 

Estimation  in  the  Decreasing  Model.  When  R,  is  given  by  equation 
(2),  we  make  this  substitution  in  [3j  and  thereby  obtain 

(7)  LD(Ro,a)  =  i  (x|)  (Roe'a  Vitl  -  V’°  ti)ni‘Xi' 

i=0 

where  the  subscript  ^  indicates  employment  of  the  decreasing  model. 

On  taking  logarithms  and  differentiating,  we  have 


Ex  .  .  -  x  t, 

.  i  kin.  -  x.)e  1 
0  •  i  i' 

— 

O 


)lnL_  k  k  t,{n.  -  x.)e  U  *i 

—5.  r  .  z  x  t  +  R  2  -ili - i- - 

8Q  0  11  °  0  1-R  e-ati 

o 


On  equating  the  above  partials  to  zero,  the  estimating  equations  become 
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When  (2)  is  the  appropriate  model,  the  required  estimates  and 

a  are  found  by  simultaneously  solving  the  two  equations  of  (9).  In  this 
case,  should  the  value  $  thereby  obtained,  turn  out  to  be  negative  (an 
unacceptable  result)  this  suggests  that  either  we  should  set  ft  =  0  or 
that  the  increasing  model  of  (l)  should  be  employed. 

4.  CHOOSING  THE  MODEL.  In  many  applications,  a'  priori  con¬ 
siderations  dictate  which  of  the  models  considered  here  is  appropriate. 

In  others,  the  sample  data  will  clearly  indicate  which  model  Is  to  be 
preferred.  In  perhaps  the  majority  of  applications,  the  choice  of  the 
model  will  involve  a  more  careful  analysis  of  sample  data,  and  the  follow¬ 
ing  procedure  is  suggested  for  choosing  between  the  increasing  reliability 
model  of  (l)  and  the  decreasing  reliability  model  of  (2). 

1.  Solve  equations  (6)  for  tentative  estimates  of  Rq  and  a 

in  the  increasing  reliability  model.  If  the  tentative 
estimate  of  a  thus  obtained  is  positive,  accept  both 
tentative  estimates  and  designate  them  as  and  a,,. 

If  the  estimate  of  a  obtained  from  (6)  is  negative  then 
accept  as  estimates  =  0  and  Rqj  = 
k  k  ° 

£  x./n  where  n  =  £  n,  . 
i=0  1  i=0  1 


Solve  equations  (9)  for  tentative  estimates  of  Rq  and  a 
in  the  decreasing  reliability  model.  If  the  tentative 


estimate  of  a  from  these  equations  is  positive,  then 

accept  both  tentative  estimates  from  (9)  and  designate 

them  as  it  _  and  cL.  If  the  estimate  of  a  obtained 
oD  D 

from  (9)  is  negative,  accept  as  estimates  a  =  0  and 
*  k 

RoD  '  ,E_  */”• 
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3.  Calculate  Lj  =  JL^R^,  Sj)  and  LD  =  Ld^qd’  ®d)  u“ing 
equations  (4)  and  (7)  respectively. 

A  A 

4.  If  L  >Xi_,  choose  the  increasing  reliability  model;  if 

a  D 

L_  >  L.,  choose  the  decreasing  reliability  model.  Other- 
wise  (if  L_  *  L_),  we  employ  the  constant  reliability  model 

a  k 

R  ,  =  R  with  a  =  0,  and  with  R  ■  £  x./n  . 

1  0  0  1*1  * 

5.  ASYMPTOTIC  VARIANCES  AND  COVARIANCES  OF  ESTIMATES. 
The  asymptotic  variance -covariance  matrix  of  the  maximum  likelihood 
estimates  ft^  and  a  is  given  as 


9R0 

-E(82‘nL  ) 
v  8R  8a  1 
o 

V(ftQ)  Cov(fto,  a  ) 

(10) 

l8a8R  ’ 

.  o 

v  v  > 

da  _ 

S 

Cov(fto,a  )  V(ft) 

_  — 

where  E  symbolizes  expected  values.  In  practice,  satisfactory  approxi¬ 
mations  can  be  obtained  by  replacing  expected  values  of  the  partials  with 

their  actual  values  calculated  using  R  =  ft  and  a  s  £  .  The  required 

oo 

second  partials  follow  from  further  differentiation  of  (5)  and  (8)  in  turn. 
These  results  are  given  below. 


For  Increasing  Reliability. 


02lnL. 


8R! 


£  K  -  xi> 

-  - 5“ 


-2  at, 
x^e  i 


0  (1-R0)‘ 


0  [1  -(1  -  R0)e‘ati]2 


(ID 


02lnLj 

8R  8a 
o 


S^nLj 

8a  8  R 


k 

=  -  Z 


.  -  o  ti 

xiV»  1 


[1  -(l-R>-ati]2 


82lnL  k  t2x.e"  a  ** 

- yi-  =  -  (1  -  R  )  Z  - Li - — 

\  8o2  0  0  [1  -  (1  -  R  )e"ati]  2 


% 


For  Decreasing  Reliability. 


(12) 


/\2,_T 
/  "  *“~D 

U 

7 

-  S  x  /R  - 

o  1  0 

i.  /_  at, 

-  v-  -  -j/- 

(  9R2 

0  (1  .  Roe-°  *i)2 

) 

.  S2lnLD 

k  (i, 

8R  fta 

0 

9a  9R 

o 

0  (l-Roe‘ati)2 

82lnLD 

k  t2(n 

_  n  ^  A  * 

V 


9a 


0  (l-R.-aV 


Although  asymptotic  variances  and  covariances  might  be  misleading 
for  small  samples,  they  should  closely  approximate  the  true  variances 

k 


and  covariances  for  moderate  size  samples;  i.  e.  for  n  =  2  n^  in  excess 


of  say  20. 


1 


The  variance  of  $1.  which  is  of  course  a  function  of  ft  and  ^  , 
i  o 

can  be  approximated  by  employing  a  theorem  of  Cramer  [l]  which  enables 

us  to  write 


(13)  V(&  )  =  (^-)2  V(Ro)  ♦  24L)(4-)Cov(£0,S)  ♦  VJS)  . 

o  o 


that 


For  the  increasing  reliability  model,  it  follows  from  equation  (1) 


9R 


-at. 


8Ri 


=  e  "i  and  1  =  t  (l  -  R  )e  a  *1 

9a  iv  o' 


Accordingly,  in  this  case,  we  have 


(14)  VC^)  «  e'2lltHv(^o)  +  2(l -fc^Cov^,  $)  +  (l  ••  fc/t^Vfi)]  . 
For  the  decreasing  reliability  model,  it  follows  from  equation  (2) 

that 
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In  this  case 


A 

SR 


=  e 


-a  t 


i,  and 


A 

da 


=  -t  R  e 
i  o 


-a  ti 


(15) 


V(R^)  becomes 

V(ftt)  *  e'2fi  \v$o)  -  2iotlCov(fto  S)  +  k2t2V(&)] 


6,  ILLUSTRATIVE  EXAMPLES,  In  order  to  illustrate  the 
practical  application  of  results  of  this  investigation,  let  us  consider 
simulated  test  data  on  two  complex  systems,  one  with  increasing  reli¬ 
ability  and  the  other  with  decreasing  reliability, 

Example  1,  Increasing  Reliability.  Following  are  results  of  the 
initial  and  four  subsequent  tests  conducted  on  this  system. 


*i 

(time  periods)  0 

1 

2 

3 

4 

ni 

(number  tested)  20 

10 

5 

5 

5 

xi 

(number  successes)  13 

8 

5 

4 

5 

x^/n^  (success  ratio)  0.65 

0.80 

1.00  0.80 

1.  00 

4  4  4 

Summarizing,  we  have  n  *  S  n.  =  45,  £  x.  =  35,  £  t,x,  ■  50, 

0  1  0  1  o  1  1 

4  4 

£  (n,  ’  *,)  B  10>  and  S  t  (n  -  x.)  =  5,  Our  problem  now  is  to  substitute 
0  i  i  0  1  1  1 

these  values  into  (6)  and  solve  for  the  required  estimates  ft  and  a,  . 
Any  standard  iterative  method  might  be  employed  for  this  purpose,  but 
the  following  procedure  seems  relatively  straightforward  and  should  be 
generally  satisfactory. 

(o) 

As  an  initial  approximation,  Rv  ,  we  select  the  initial  success 
ratio  x0/nQ  =  0.  65,  and  as  an  initiaP  approximation  to  R^,  the  success 

ratio  x./n.  ■  0.80.  We  substitute  these  two  values  in  (1)  with  t.  ■  1  to 
obtain  1 

0.  80  =  1  -  (l  -  0.  65)e  a  ,  and  it  follows  that 


-a 

e  a 


1  -  0,80 
0.  35 


0. 57143. 


357 


Reading  from  a  table  of  exponential  functions,  we  have  as  an  initial 
approximation,  a\°)  =  0.  56.  The  superscripts  serve  tp  indicate  the 

i  f  ii  *  _*♦  sir  i  it  i  i  <  i  •  /t  n  i 

u  1'UC  1  ox  cite  appiuAUiiduuuo.  tre  auuocjuciiMy  tiy  ickkiu^  u  -  u.  anu 

0.  55  respectively  in  the  two  equations  of  (6)  and  solve  these  in  turn  for 
Rq.  We  of  course  are  seeking  a  value  of  a  such  that  the  two  values  of 

Rq  thus  obtained  are  identical.  Following  is  a  summary  of  these  results 

including  interpolation  to  obtain  new  approximations  and  R^  . 


a 

R  from  1st.  Eq. 

°  of  (6) 

R  from  2nd  Eq, 

°  of  (6) 

Difference 

0.  500 

0.  662 

0.  685 

-0.023 

0.  543 

0.  656 

0.  656 

0 

0.  550 

0.  655 

0.  651 

+0.004 

As  new  approximations ,  we  have 


543  and  R 


(1) 


0.  656  . 


We  now  elect  to  seek  further  improvement  through  Newton's  method 
which  is  based  on  Taylor  aeries  expansions  of  the  estimating  equations 
about  a  point  in  the  vicinity  of  their  simultaneous  solution.  Let  h  and 
k  designate  corrections  to  be  determined  by  the  iteration  process  so  that 

=  R^  +  h  and  a  =  a  ^  +  k.  Using  Taylor's  theorem  and  neglecting 

terms  containing  powers  of  h  and  k  above  the  first,  we  have  as  correc¬ 
tion  equations 

aZlnL.  92lnLT  ftlnL. 

,  _ ,  _ I  _ 

"  2  9R  8a  9R 

9R  o  o 

o 

92lnLi  32lnLj  9lnL,j 

h  9R  9a  +  k  “  =  "  ~da  ' 

o  da 


? 


1 


which  are  to  be  solved  simultaneously  for  h  and  k. 


Using  (5)  and  (ll)  we  evaluate  the  partials  in  these  equations  at  the 

point  R  =  0.  656,  a  =  0.  543,  and  the  correction  equations  become 
o 


-119. 9088  h  -  16.7561  k  =  0.0998, 
-  16.7561  h  -  11.6602  k  =  0.0038. 


) 

i 
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Solving,  we  have  h  =  -0,00098  and  k  =  0.00109.  Thus  the  final  esti¬ 
mates  become 

R  =  0.  656  -  0.00098  =  0.6550, 
o 

S  =  0.  543  +  0,  00109  =  0.  5441  . 

As  verification  of  the  accuracy  of  these  final  estimates,  they  were 
substituted  into  the  first  partial s  of  (5)  with  results  as  follows; 


=  0.  001, 


=  0.001. 


R  =0.6550 
o 

o  =0.5441 


R  =0.  6550 
o 

a  =0.  5441 


Values  of  zero  would  have  indicated  perfect  agreement.  The  small  values 
obtained  here  are  considered  satisfactory  and  no  further  iterations  are 
deemed  necessary. 


Rather  than  employing  the  intermediate  interpolative  procedure,  we 
might  have  moved  directly  from  the  initial  approximations  to  the  Newton 
method.  In  that  case,  of  course,  one  or  more  additional  cycles  of  the 
Newton  iteration  might  have  been  required  to  reach  the  same  final 
results  as  those  obtained  here. 


Using  values  of  the  second  partials  employed  as  coefficients  in  the 
correction  equations,  the  variance-covariance  matrix  of  (10)  is  approxi¬ 
mated  as 

Pi  19. 9088  16.7561  1  f  0.  0104  -0.0150  1 


L  6. 7561 


16.7561  0.0104  -0.0150 

11.  6602  J  -0.0150  0.1073  . 


Accordingly  we  have 


V(ft  )  =  0.0104,  V(o)  =  0.1073,  Cov(R  ,a  )  =  -0.  01  50. 


Using  these  values  in  equation  ( 14)  we  calculate  V(R^)  at  times 

tj  bO,  1,  2,  3  and  4.  We  also  calculate  the  predicted  values  of  R^ 

(i.  e.  Rj)  at  these  times  using  equation  (1)  with  o  =  ft  =  0.  5441  and 

R  s  R  b  0.6550,  These  results  are  displayed  below  along  with  actual 
success  ratios  for  comparison. 
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0 

1 

2 

0 

4 

V"i 

0.  65 

0.  80 

1.00 

0.80 

1.00 

K 

0. 6550 

0. 7998 

0.8838 

0.9326 

0.9609 

0.0104 

0.0043 

0.0046 

0. 0036 

0.0022 

An  attempt  to  fit  the  decreasing  model  of  (2)  to  these  data  resulted 
in  a  value  a  <  0  as  absolution  of  (9).  We  were  thus  led  to  estimates 

a_  =  0  and  6  _  =  2  x./n  =  0.  7778.  Using  these  estimates 

"  i=0  a1 

in  (7),  we  calculate  Lq  =  0.  0008, whereas  using  the  estimates 
a.j  =  0.  1472  and  R  ^  =  0  6074  in  equation  (4),  which  applies  when  the 

increasing  model  of  (1)  is  employed,  we  calculate  L.  =  0.  005.  Since 

A  a  * 

L,j  >  Lq,  our  choice  of  the  increasing  reliability  model  of  (l)  in  this 
instance  is  verified  as  being  correct. 

Illustrative  Example  2.  Decreasing  Reliability.  Following  are  test 
data  on  a  system  in  which  reliability  is  decreasing  with  time. 


*i 

(Time  periods) 

0 

1 

2 

3 

4 

ni 

(Number  Tested) 

20 

5 

5 

5 

3 

xi 

(Number  Successes) 

12 

3 

2 

2 

1 

Vni 

(Success  Ratio) 

60 

.  60 

.40 

.40 

.  33 

Summarizing,  we  have  n  =  2^  n^  =  38, 


2  j  x.  =  20, 


2,  t.x.  =  17, 
1  i  i 


4  4 

2  i  (nf  -  x.)  =  18,  and  £  ^  t^(n  -  x^)  =  25.  Proceeding  to  solve  equations 

(9)  using  these  data,  we  again  select  as  an  initial  approximation  to  R  , 

the  initial  success  ratio.  Thus  we  have  r(°)  =  0.60.  The  initial  ° 

o 

approximation  to  a  comes  from  a  free-hand  curve  through  the  points 
on  a  plot  of  the  success  ratios  versus  time  as  a'°'  =  0.  12. 


This  time,  we  skip  the  intermediate  approximations  as  used  in  the 
first  illustration  and  proceed  immediately  to  the  Newton  method.  At  the 
end  of  one  cycle,  we  have  as  improved  approximations 
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Rq1)  *  0.605  and  a^sO.145, 

With  the  partial®  of  (8)  and  (11)  evaluated  for  R.  =  0.  605  and 
a  *  0. 145,  the  correction  equations  become 

-iZ2. 9348  h  +  48. 5840  k  =  .0. 1886, 

48. 5840  h  -  76. 0959  k  *  -0. 0496. 


On  solving,  we  find 

h  *  0.0024  and  k  =  0.0022, 

and  as  final  estimates  (or  new  approximations)  we  have 

R  =  0.6050  +  .0024  =  0.  6074 
o 

a  =  0. 1450  +  .0022  =  0. 1472. 


These  values  are  substituted  into  the  first  partials  of  (8)  with  the 
following  results 


81nLD 

dR 

o 


=  -0.0001, 

R  =0.6074 
o 

a  =0. 1472 


dlnL>_ 


8a 


=  -0.0012. 

R  =0. 6074 
o 

a  =0.1472 


These  values  are  considered  to  be  sufficiently  close  to  zero  to  justify 
acceptance  of  &  =  0.  6074  and  a  =  0.  1472  as  final  estimates,  and  no 
further  iterations  were  made. 

As  in  illustration  1,  the  variance -covariance  matrix  of  R  and  a 
is  approximated  uning  coefficients  of  the  correction  equations.  Thus  we 
have 


- 

«■ 

-1 

- 

122.9348 

-48. 5840 

0. 0109 

0.0069 

-48. 5840 

76.0959 

0. 0069 

0.0176 

Accordingly  for  this  example,  we  have  V(R  )  =  0.0109,  V(ei)  =  0.0176, 

and  Cov(R  ,a)  =  0.0069.  The  variance  of  R,  at  t.  =  0,  1,2,  3  and  4 
o  i  i 

is  computed  from  (15)  and  the  predicted  (estimatedjvalues  of  R^ 
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(designated  R^  for  these  same  time  values  are  computed  from  (2). 
These  results  along  with  the  success  ratios  are  displayed  below. 


*i 

0 

1 

2 

3 

4 

V"i 

0.  60 

0.  60 

0.40 

0.40 

0.  33 

* 

R. 

0.  6024 

0. 5246 

0.4525 

0.  3906 

0. 3371 

V(fy 

0.0109 

0,0069 

0. 0112 

0.  0183 

0.0250 

An  attempt  to  fit  the  increasing  model  of  (1)  to  the  data  for  this 
example  resulted  in  a  value  a  <  0  as  a  solution  of  (6)  and  we  were  thus 

led  to  &„  a  0  and  fl  .  =  E  x./n  -  0.  5263.  Using  these  estimates  in 

1  ^  ol  i30  x' 

(4),  we  calculate  =  0.001,  whereas  using  the  estimates  =  0.  1472 
and  =  0.  6072  in  (7),  we  calculate  =  0.  002.  Thus  with 
for  these  data,  the  decreasing  model  of  (2)  is  the  proper  choice. 

7.  SOME  CONCLUDING  REMARKS.  Although  questions  relating 
to  how  many  tests  should  be  conducted  and  when  they  should  be  scheduled, 
have  not  been  formally  examined  here,  they  are  not  to  be  dismissed  as 
being  unimportant.  When  tests  are  destructive  and  the  cost  is  great, 
there  is  considerable  pressure  to  limit  their  number.  Considerations 
having  little  to  do  with  statistics  or  probability  often  dictate  that  a  rather 
large  proportion  of  available  test  specimens  be  expended  in  the  initial 
tests.  Such  allocation,  of  course,  limits  the  number  available  for  subse¬ 
quent  testing.  Further  studies  in  this  area  to  determine  optimum  test 
designs  are  still  in  progress, 

When  this  investigation  was  begun,  it  was  intended  to  consider  not  only 
the  exponential  models,  but  also  the  hyperbolic  model 

R  -  R 

R(t)  =  R  +  —  ;  t  >  0,  where  0  <  R  <1, 

'  oo  a  t  +  1  —  —  o  — 

0  <  R  ®  <  1 ,  and  a  >  0.  As  in  the  exponential  models,  R^  is  the  initial 

probability  at  time  t  =  0.  R  «  is  the  final  or  ultimately  attainable 
reliability;  i.  e.  Lim  R(t)  =  Res,  In  this  model,  reliability  is  increasing 
t 

or  decreasing  with  time  depending  on  whether  R«  >  Rq  or  Res  <  Rq. 
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A  special  case  of  the  hyperbolic  model  with  a  =  1  and  Roe  >  R  has  been 
considered  by  Lloyd  and  Lipow  [2]  . 

Procedures  similar  to  those  employed  in  this  paper  can  be  used  to 
estimate  parameters  o  ,  RQ,  and  R«  in  the  general  case,  but  in  view 

of  the  length  that  the  present  paper  has  already  attained,  further  con¬ 
sideration  of  this  model  ie  being  temporarily  deferred. 
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ESTIMATION  OF  TIME  FUZE  CHARACTERISTICS 
BY  NON-LINEAR  REGRESSION  METHODS 

Weldon  F.  Willoughby 

U.  S.  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland 

INTRODUCTION.  Ballistic  tests  of  mechanical  time  fuses  provided 
data  which  indicated  that  the  biases  in  functioning  time  (i.  e.  ,  the  differ¬ 
ences  in  the  running  time  and  the  set  time)  for  a  given  time  setting  were 
relatively  large  and  widely  dispersed  when  the  firing  a  were  conducted  at 
low  temperatures.  For  the  firings  conducted  at  higher  temperatures,  the 
biases  decreased  in  magnitude  and  became  more  uniform  as  the  tempera¬ 
ture  increased.  Since,  in  the  past,  the  bias  in  functioning  time  of 
mechanical  time  fuzes  assembled  to  artillery  projectiles  had  been  expressed 
implicitly  in  the  firing  tables  as  a  function  of  set  time  alone,  an  investiga¬ 
tion  was  conducted  to  determine  the  dependence  of  fuze  bias  on  tempera¬ 
ture  as  well  as  set  time.  In  addition,  it  was  desired  to  find  an  equation 
expressing  the  relationship  between  fuze  bias,  temperature  and  set  time 
which  could  be  programmed  for  use  on  the  Field  Artillery  Digital  Auto¬ 
matic  Computer  (FADAC). 

Plots  of  the  bias  in  fuze  functioning  time  versus  set  time  for  constant 
temperatures  indicated  that  the  two  variables  were  linearly  related.  On 
the  other  hand,  plots  of  fuze  bias  versus  temperature  for  constant  time 
settings  resembled  single  branches  of  rectangular  hyperbolas,  indicating 
a  nonlinear  relationship  between  bias  and  temperature. 

From  these  indications,  and  after  trying  several  models,  a  candidate 
model  equation  containing  two  linear  parameters  and  one  nonlinear 
parameter  was  assumed  to  adequately  describe  the  relationship  among 
fuze  bias,  the  dependent  variable,  and  temperature  and  Bet  time,  the  two 
independent  variables.  In  the  model,  it  was  assumed  that  only  the  biases 
were  affected  by  errors  of  measurement. 

As  is  well  known  (see  [2]  and  [8]  ,  etc. ),  the  method  of  least  squares, 
which  is  the  method  most  often  used  in  regression  problems,  may  be  used 
to  estimate  the  parameters  of  functional  relationships  among  sets  of 
experimental  data  whenever  it  can  be  assumed  that; 

(a)  the  dependent  variable,  Y,  is  related  to  known  levels  of  a  set 
of  independent  variables,  X^,  X^, . .  .  X^,  by  a  relationship  of  the  form 

Y=  PlXt  +  P2X2  +•  •  •  +  Pk  Xk  +  « 


(1) 


where  the  (i  :  1,  2 . k)  are  unknown  parameters  and  «  Is  the  error 

in  the  observed  value  of  the  dependent  variable,  and 

(b)  the  errors  in  the  observed  values  of  the  dependent  variable  are 
independent  and  randomly  distributed  with  zero  mean  and  a  common 
variance,  (in  addition,  if  valid  statistical  tests  of  significance  are  to 
be  made,  it  is  also  necessary  to  assume  that  the  errors  are  normally 
diet  vibuted. ) 

However,  when  the  functional  relationship  among  the  variables  cannot 
be  expressed  as  a  linear  combination  of  the  unknown  parameters  as  in  (l), 
the  usual  procedures  for  estimation  by  the  method  of  least  squares  are  not 
directly  applicable.  Several  procedures  are  available  (see  [3]  ,  [4]  ,  [6]  , 
and  [7]  )  for  estimating  the  parameters  of  nonlinear  functions.  These 
procedures  generally  employ  a  transformation  of  the  function  into  a 
linear  form  either  by  a  change  of  variables  or  by  an  approximation  based 
on  a  Taylor's  series  expansion  under  the  assumption  that  the  function  is 
locally  linear.  In  connection  with  the  latter,  the  approximating  procedures 
require  iterative  processes  to  converge  to  solutions  and  the  advent  of  high 
speed  computers  has  greatly  facilitated  the  solution  of  nonlinear  regression 
problems  by  these  methods. 

For  this  problem,  the  model  equation  was  assumed  to  be  of  the  form 


(2) 


y  -  Pl+P2X2j  a  +< 
ijk  X2j+P3  11 

PlXli  +  P2  Xli  X2j  + 
‘X2j+P3  X2j+P3  ijk 


where  the  observed  fuze  bias  at  time  setting  and  temperature 

X£j  and  is  a  random  error  with  zero  expectation.  Assuming  this  model 
equation,  the  regression  function  to  be  fitted  is 

h  +  1*2  x2i 

“  - E  ( V  ■  xii 


Pi  xli 
X2j  +  P3 


X2j  +  1*3 
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It  can  be  seen  by  inspection  of  the  first  form  of  (3)  that,  for  a  constant 

temperature  (X,.  is  constant),  the  regression  function  represents  a 
*-J 

straight  line  passing  through  the  origin  (zero  set  time  and  zero  bias)  and 
for  a  constant  time  setting  (X^.  is  constant),  the  regression  function 

represents  a  rectangular  hyperbola  with  vertical  and  horizontal  asymptotes. 

The  function  given  in  (3)  was  fitted  to  sets  of  data  obtained  from 
ballistic  functioning  tests  of  the  mechanical  time  fuze.  Least  squares 
estimates  of  the  three  parameters  were  determined  first  by  an  iterative 
process  (after  linearizing  the  function)  which  exploited  the  facility  and 
speed  of  computation  of  the  Ballistic  Research  Laboratories  Electronic 
Scientific  Computer  (BRLESC)  in  scanning  the  parameter  space.  Then, 
as  a  check  on  the  results  obtained  by  this  procedure,  least  squares 
estimates  were  also  obtained  by  the  Hartley  [5]  modification  of  the  Gauss - 
Newton  iteration  which  in  theory  has  the  highly  desirable  property  of 
guaranteed  convergence  to  estimates  yielding  the  absolute  minimum  sum 
of  squares  of  residuals  provided  the  initial  estimates  of  the  parameters 
are  in  the  neighborhood  of  the  final  values. 

In  order  to  obtain  approximate  confidence  intervals  about  the  individ¬ 
ual  parameters,  as  estimate  of  the  variance -covariance  matrix  of  the 
least  squares  estimates  was  obtained  using  the  Fisher  information  matrix 
described  by  Rao  [9]  .  The  confidence  intervals  were  constructed  by  the 
procedure  described  by  Stone  in  his  discussions  on  the  paper  by  Beale 
in  [1]  . 


THE  SCANNING  PROCESS.  To  determine  estimates  of  the  unknown 
parameters  by  the  scanning  process,  the  regression  function  was  linear¬ 
ized  by  substituting  an  initial  estimate  of  the  nonlinear  parameter  P^, 

The  two  linear  parameters,  pj  and  p^,  were  then  estimated  by  the  usual 
least  squares  procedure.  The  sum  of  squares  of  residuals  was  computed 
using  the  three  estimates  of  the  parameters.  In  the  next  iteration,  the 
initial  estimate  of  P3  was  changed  by  some  small  increment  and  new 
estimates  of  Pj  and  P2  were  determined  as  before.  Again,  the  sum  of 
squares  of  residuals,  using  the  new  estimates,  were  computed.  The 
process  was  repeated  until  a  minimum  sum  of  squares  of  residuals  over 
a  rather  large  range  of  estimates  of  33  was  obtained.  The  estimates  of 
the  parameters  which  gave  the  minimum  were  considered  to  be  the  least 
squares  estimates. 


If  Pj,  the  value  of  33  which  gives  the  minimum  sum  of  squares  of 
residuals,  is  substituted  in  (3),  the  denominator  of  each  of  the  terms 
could  be  considered  to  be  of  the  form 


(*> 


2j 


"  X2J  +  P: 
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Now,  let 


(5) 


Then  (  3)  can  be  written  as 

<‘>  *  «V  -  "l  *11  +  "2  *H  ‘*J 


which  is  linear  in  |3j  and  Least  squares  estimates  of  (3^  and  82, 

for  the  given  value  of  (3 3  ,  may  be  obtained  by  solution  of  the  normal 
equations  resulting  from  minimizing  the  sum  of  squares  of  residuals 


Q  <»  ■  *  <  V  -  <"1  *»  +  *11  *2i»' 


respect  to  p.  and  p_,  when  the  errors  in  the  Y  are  independent 

I  fc  ljk 


and  distributed  with  mean  zero  and  constant  variance 


On  the  other 


hand,  when  the  errors  in  the  Y^k  have  different  precision,  i.e.,  the 
variance  of  errors  in  Y  ^  is  not  constant,  the  sum  of  squares  of 
residuals  to  be  minimized  is  of  the  form 


(8)  Q  (0)  -  £  -y  <  Vijk  -  (Pl  «u  4  82  .lt  >tj)  )- 

where  the  are  relative  weights  of  the  Y^k  which  make  the  quantities 


V  /— ■  1 

Y...  ■  VcT  Y... 

ijk  ljk 


have  a  common  variance.  (In  the  case  of  homogeneous  variances,  the 
relative  weights,  w..  =  1.)  Thus,  a  predicted  value  of  Y  may  be  deter¬ 
mined  from  the  equation 
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Yij  =  Pi  *li  +  P2  hi  hj 


(10) 


P1  Xli  ,  P2  Xli  X2i 

+  VT 


The  standard  error  of  estimate  is  given  by  the  expression 


<r  = 


n  -  3  *  "ij  ^Yijk  “  ^1  *li  +  ^2  *11  *Zp  * 

1JK 


In  the  process  of  determining  the  prediction,  it  was  noted  in  the 
examination  of  the  data  that  the  dispersions  of  the  observed  biases  varied 
considerably  from  temperature  to  temperature  and  to  some  degree  from 
time  setting  to  time  setting,  As  previously  stated,  a  necessary  assumption 
for  the  application  of  the  least  squares  method  is  that  the  variances  of  the 

Therefore,  it  was  necessary  to  perform  a 


errors  in  the  be  constant. 


transformation  of  the  biases  to  remove  tha  effect  of  heterogenous  variances 
at  the  various  temperatures  and  time  settings.  A  suitable  transformation 
found  in  [8]  is  to  let 


(12) 


where  Y  is  the  kth  observed  bias  and  «r  is  the  standard  deviation  of 

the  biases  at  the  i**1  set  time  and  temperature.  The  transformed 
variates,  Y^,  have  the  property  that  their  variances  equal  one.  That 


is, 


Var  (Y't)  a  E 


[YUk  ‘  E  <  V] 


=  E 


liUL  -E 
-<rij 


Y. 


,JJ!L 


)] 
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Since  the  true  variances  of  the  biases  were  not  known,  the  reciprocal  of 
the  sample  variances  were  used  as  the  relative  weighting,  factors, 

In  order  to  cover  the  range  of  feasible  values  of  (3^,  the  estimates 
of  this  parameter  used  in  the  determination  ranged  from  -10,000  to 
+ 10, 000,  This  range  was  scanned  first  at  intervals  of  100,  10,  1,  0, 1, 
and  0,  01  until  the  value  of  P3  was  found  which  gave  the  smallest  sum  of 
squares  of  residuals.  In  each  iteration  of  the  process,  least  squares 
estimates  of  0}  and  (32  corresponding  to  the  estimate  of  p3  were 
computed.  Plots  of  the  error  root  mean  squares  (in  the  subrange  indicat¬ 
ing  a  minimum  sum  of  squares  of  residuals)  obtained  in  each  iteration 
versus  the  estimated  values  of  |33  are  given  in  Figure  1,  for  the  three 
zones  of  fire.  (A  smooth  curve  has  been  drawn  through  the  points. ) 

Table  I  gives  the  least  squares  estimates  of  the  regression  parameters 
and  the  sum  of  squares  of  residuals  for  the  three  zoneB  of  fire. 

THE  HARTLEY  MODIFICATION  OF  THE  GAUSS-NEWTON  ITERA¬ 
TION!  The  Hartley  modification  of  the  Gauss -Newton  iteration  initially 
requires  the  expansion  of  the  regression  function  in  a  multiple  .first  order 
Taylor's  series  about  initial  estimates  of  the  parameters,  fy,  (3 2  and  P31 
obtaining  an  expression  of  the  form 

Y  -  f  (Xj,  X2;  +  APj,  p2  +  A(32,  p3  +  Ap3) 

(!4)  3 

*  £  (Xi ,  X2i  ?!  ,  P2»  P3)  +  '  S  ■gjT  Af3i 

ial  ri 

where  the  partial  derivatives  are  evaluated  at  p^  *  (3^  (i  «  1,  Z,  3)  and 

the  AP1  are  corrections  to  the  (3^  to  be  computed.  This  step  is  based  on 

the  assumption  that  the  regression  function  is  linear  In  the  neighborhood 
of  the  estimated  values  of  the  parameters.  For  convenience,  (14)  may  be 
written  as 

(IS)  Y  .  f»£0  t  »  f,  AS,  . 

ial 

This  expression  is  linear  in  the  unknown  corrections,  AP^,  and  therefore, 
under  the  appropriate  assumptions,  the  method  of  lsast  squares  may  be 
employed  to  estimate  the  corrections  to  ths  initial  estimates  of  the  P^. 

The  normal  equations  are  obtained  by  minimising  ths  sum  of  squares  of 
residuals  given  by 
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assuming  .at  the  £.  are  continuous  over  the  range  of  values  of  the 
independent  variables,  X ^  and  X^. 


Then,  instead  of  applying  the  entire  correction  to  the  p  as  is  done 
in  the  Gauss -Newton  iteration,  a  fraction  v  of  the  correction  is  applied, 
where  v  is  determined  as  follows. 


Consider  the  sum  of  scnares  of  residuals  to  be  a  function  of  v  by 
defining  it  as 


(17)  Q  (v)  =  Q  (Px  +  vAPj,  P2  +  vAP2>  P3  +  Ap3)  o<  v<l. 

(The  p.,  the  initial  estimates  of  the  parameters  and  the  Ap.,  the  corrections 
to  the  estimates  are  known  values,  leaving  only  v  unknown.)  The  value  of  v 
giving  a  minimum  of  Q  (v)  is  found  by  an  approximate  method  by  deter¬ 
mining  the  level  of  v  at  which  the  parabola  passing  through  Q  (0),  Q  (j), 
and  Q  (1)  has  a  minimum.  Using  the  Lagrange  method,  the  parabola 
passing  through  these  points  can  be  written  as 

(18)  §  (v)  =  [20(0)  -  4Q(i)  +  20(1)]'  v2  -  [30(0)  +  4Q($)  -  Q(l)]  v  +  Q(0)  . 


After  differentiating  ^  (v)  with  respect  to  v  and  setting  the  results 
equal  to  zero,  the  value  of  v  giving  a  minimum  of  §  (v)  is  found  to  be 


(19) 


min 


3Q(0)-  40(1)  +0(1) 

4  [6(0)  -  26(f)  +0(1)1 


Q(0)  -  Q(l)  i 

4[Q(0)  -  2Q(i)  +Q(l)]  2 


Using  this  value  of  v,  the  new  estimates  of  the  p^  to  be  used  in  computing 
the  sum  of  squares  of  residuals  and  in  the  next  iteration  is  computed  from 
the  expression 


(20) 
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.  «*  P,  +  v  Ap. 
l  i  min  ri 
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The  above  procedure  is  repeated  until  the  estimates  yielding  the  minimum 
sum  of  squares  of  residuals  is  obtained. 

As  indicated  in  [5]  ,  in  the  event  that  the  value  of  vffi^  does  not 
give  a  reduction  in  the  sum  of  squares  of  residuals  from  one  iteration  to 
the  next,  the  value  of  v  yielding  a  minimum  of  §  (v)  in  an  interval  of 
half  the  length  should  be  used  to  compute  the  new  estimates. 

Because  the  Hartley  modification  requires  initial  estimates  that  are 
in  the  neighborhood  of  those  yielding  the  absolute  minimum,  and  since 
this  procedure  was  to  be  used  as  a  check,  initial  estimates  of  the  para¬ 
meters  were  selected  in  the  neighborhood  of  the  final  estimates  obtained 
in  the  scanning  process.  Table  II  gives  the  least  squares  estimates  of 
the  parameters  of  (3)  ard  the  sum  of  squares  of  residuals  obtained  in  the 
final  iteration  of  the  Hartley  modification. 

CONSTRUCTION  OF  CONFIDENCE  INTERVALS.  Another  procedure, 
presented  in  [1]  ,  yields  least  squares  estimates  of  the  parameters  as 
well  as  an  estimate  of  the  variance -covariance  matrix  of  the  least  squares 
estimates,  which  can  be  used  to  construct  approximate  confidence 
intervals  about  the  individual  parameters.  This  procedure  is  based  on 
the  Fisher  information  matrix  as  described  in  [9]  . 


Corrections  to  initial  estimates,  p  are  derived  by  expanding  the 
normal  equations  in  Taylor's  series  about  the  initial  estimates,  obtaining 
expressions  of  the  form 
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where  AP.  =  pj  -  P.  and  Q{p)  is  the  sum  of  squares  of  residuals.  From 
this,  the  set  of  normal  equations  can  be  written  in  matrix  notation  as 


(22) 


y  Ap  =  g 


where  V  is  the  Fisher  information  matrix  with  elements 
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9(3.  ap. 


1,  2,  3)  . 


(23) 


AP  is  a  column  vector  with  components  Apj,  and  2G  ie  a  column  vector 
with  components  •  Solution  of  (22)  yields  the  corrections  which 

5pi 

are  to  be  applied  to  the  initial  estimates- to  obtain  estimates  to  be  used  in 
the  next  iteration.  That  is. 

(24)  AP  =  V"1  G 


pl  =  P  + AP  =  P  +  V"1  G 


where  P  is  the  vector  of  estimates  to  be  used  in  the  next  iteration  and 

Ml  1 

p  is  the  vector  of  initial  estimates. 

When  the  process  has  converged  to  the  least  squares  estimates,  the 

matrix  V"  <r  is  an  approximation  to  the  variance -covariance  matrix 
of  the  least  squares  estimates.  Using  this  approximation,  it  is  possible 
to  construct  confidence  intervals  about  the  individual  parameters  such 
that 

f  ^  -  ^3vU  Fa  (3.  n-3)'<  P1  <  ^  ^ 3y11  Fq  (3,  n  3)"\ 


P  {  *2  -  * 


3v  Fft  (3.  n-3)  <p2  <^2  +fry3v6fi  Fa  (3.  n-3)\>l- 


^  -  3--^3v33  Fa  (3,  n-3)  <  P3  <  P3  +  *  V3v  Fa  (3,  n-3) 


where  V  *  (i  =  1,  2,  3)  is  the  diagonal  element  of  the  ith  row  of  V  , 
F^  (3,  n-3)  ie  the  a  percentile  of  the  F  distribution  with  3  and  n-3 

degrees  of  freedom  and 

(27)  •  ■  ' 


Ninety  percent  confidence  interval  estimates  based  on  the  estimates  * 
obtained  in  the  scanning  process  and  those  obtained  by  the  Hartley  modi¬ 
fication  are  given  in  Table  III.  A  combination  of  the  Hartley  modification 
and  the  procedure  described  in  this  section  yielded  estimates  that  gave  a 
slightly  smaller  sum  of  squares  of  residuals  in  each  of  the  three  zones. 


.n-’.  t 


I 


Point  estimates  and  90%  confidence  interval  estimates  based  on  the 
combined  procedure  are  given  in  Table  IV. 


DISCUSSION  OF  RESULTS.  It  can  be  seen  by  inspection  ot  Tables 
I  and  II,  which  give  the  estimates  obtained  in  fitting  the  regression 
function  by  the  scanning  procedure  and  by  the  Hartley  modification  of  the 
Gauss-Newton  inte  ration,  that  the  results  of  the  two  procedures  do  not 
differ  to  any  great  degree.  In  general,  the  estimates  of  |3,j  obtained  by 

the  two  processes  differ  more  than  the  e stim&te s  of  the  other  two  para¬ 
meters,  especially  in  Zone  II,  However,  it  is  pointed  out  that  the  error 
root  mean  squares  in  the  neighborhood  of  the  apparent  minimum  are  less 
sensitive  to  small  (positive)  changes  in  this  estimate  than  in  those  for  the 
other  two  parameters.  This  can  be  seen  from  Figure  1.  In  addition, 
in  examination  of  the  error  root  mean  squares  in  the  various  iterations  of 
the  Hartley  modification,  it  was  noted  that  a  difference  of  as  much  as  four 
in  the  estimates  of  in  the  neighborhood  of  the  minimum  caused  a 
change  of  only  0.01  in  the  error  root  mean  squares. 


Further  examination  of  the  estimates  presented  in  TableB  I  and  II 
reveals  that  for  each  zone,  the  estimates  of  are  relatively  small  in 
comparison  to  the  estimates  of  Pi  and  P3.  This  may  lead  one  to  think 
that  this  parameter  does  not  contribute  significantly  to  the  model  and  may 
be  eliminated  from  consideration.  In  fact,  tests  of  hypotheses  based  on 
the  assumption  that  the  statistic 


(28) 


P,  -  0 
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is  distribuled  as  "Student's"  t  distribution,  indicated  that  the  hypothesis 
that  =  0  would  be  accepted  in  Zones  I  and  IIw  and  the  hypothesis  that 

=  0  would  be  accepted  In  Zone  III  at  the  .  05  level  of  significance.  On 

the  basis  of  these  results,  the  model  equations  for  the  various  zones 
could  be  as  indicated  below. 
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Each  of  the  two  model  equations  above  have  properties  that  are  similar 
to  those  of  the  original  model  in  that  the  regression  functions  determined 
from  these  models  represent  straight  lines  passing  through  the  origin 
when  is  constant  and  rectangular  hyperbolas  with  horizontal  and 
vertical  asymptotes  when  is  constant.  These  equations  would  be 

much  more  suitable  for  use  on  FADAC  than  the  original  model  equations. 

Figures  2,  3,  and  4  give  perspective  sketches  of  the  general  shapes 
of  the  surfaces  represented  by  the  estimated  regression  functions.  Sketches 
of  the  constant  temperature  and  constant  set  time  contour  lines  are  given 
in  Figures  5,  6,  and  7.  To  indicate  how  well  the  curves  fit  the  data, 

Figures  8,  9  and  10  give  sketches  of  the  constant  set  time  curves  (for 
selected  time  settings)  with  the  data  points  plotted.  Similar  sketches  for 
selected  constant  temperatures  are  given  in  Figures  11,  12,  and  13. 

It  can  be  seen  from  the  last  two  Bets  of  sketches  that  the  variability 
of  the  observed  biases  was  relatively  large  at  low  temperatures  for  the 
giventime  settings;  however,  there  was  little  difference  in  the  variability 
at  the  various  time  settings  for  a  given  temperature.  It  is  also  easy  to 
see  that  the  assumption  that  the  effect  of  temperature  on  bias  is  relatively 
constant  is  not  a  bad  assumption  for  temperatures  slightly  above  zero 
degree  Fahrenheit. 

On  the  basis  of  the  amount  of  information  obtainable  from  the  procedures 
discussed  in  the  foregone  sections,  it  appears  that  the  best  procedure  is 
a  combination  of  the  Hartley  modification  and  the  procedure  utilizing  the 
Fisher  Information  matrix.  Point  estimates  of  the  regression  parameters 
and  90%  confidence  intervals  obtained  by  this  combined  procedure  are 
given  in  Table  III.  The  combined  procedure  gave  sums  of  squares  of 
residuals  that  were  slightly  less  than  those  obtained  by  either  of  the  other 
two  procedures,  although,  due  to  slightly  larger  estimates  of  the  <r^  ,  the 

confidence  intervals  obtained  for  this  method  were  generally  longer  than 
those  for  the  other  two  procedures. 
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TABLE  III 


Ninety  Percent  Confidence  Intervals  about  Regression  Parameters 


Method  of 
Estimation 


Scanning 

Process 


Modification 


Zone 

No.  of 
Observations 
Considered 

I 

171 

II 

96 

III 

217 

I 

171 

II 

96 

III 

217 

90%  Confidence  Limits  on  - 


*2 


Lower  Upper  Lower  Upper  Lower  Upper 

Limit  Limit  Limit  Limit  Limit  Limit 


0.3746  0.7462'  -0.0013  0.0059  45.46  ,63.70 

0.4029  0.8755  -0.0073  0.0031  50.74  78.92 

-0.0476  0.0966  -0.0083  -0.0025  43.15  59.77 


0.3792  0.7377  -0.0012  0.0053  45.75  62.87 

0.3944  0.9344  -0.0078  0.0032  50.04  84.28 

-0.0349  0.0917  -0.0078  -0.0034  46.39  54.95 


Figure  I 


Figure 
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OBSERVATIONS  ON  THE  SELECTION  OF  PREDICTORS 


H.  L.  Lucas  and  A.  C.  Linnerud 
North  Carolina.  Stats  University  at  Raleigh 


1.  INTRODUCTION  AND  SUMMARY.  Mast  work  on  the  selection 
of  predictors  has  been  done  in  the  context  of  the  general  linear  model, 

(1)  I  -  t*!*2  ■  •  •  *p  iJ  [PLP2  •  •  ■  Pp  !]  '  - 

where  ^  (obiervatiOrtl  oh  the  predictahd)  ,  x,  (observation«  on  the  ith 
predictor)  and  (random  residuals)  are  alTn  x  1  matrices,  the  p^  (regres¬ 
sion  coefficients)  are  scalars,  and  the  prime  means  transpose,  lFor  one 
class  of  practical  situations  it  can  be  assumed  that  observations  have  been 
made  on  all  predictors  that  are  relevant  (corresponding  p^s  non-zero)  and 
possibly  on  some  that  are  not  (corresponding  p's  zero).  Given  a  set  of  data 
the  problem  is  to  decide  which  one  or  more  of  the  2P  subsets  of  predictors 
is  or  are  likely  to  be  the  correct  one,  In  the  present  study,  attention  has 
been  confined  mainly  to  selection  of  the  single  best  candidate . 


Three  criteria  of  selection,  namely,  the  residual  mean  square  (MS) 
the  Mallows  C-statlstic  (C)  and  a  modification  of  the  C-statistic  (MC),  have 
been  studied  to  date.  It  was  assumed  that  a  set  of  data  is  a  random  sample 
from  a  population  characterized  by  certain  values  of  the  p^  and  by  the  form 
and  the  parameters  of  the  joint  distribution  of  the  x*  and  The.  xj  were 
assumed  to  be  measured  without  error.  Performance  was  studied  in  terms 
of  the  probability  that  a  criterion  leads  to  selection  of  the  correct  subset  of 
predictors, 


Since  the  mathematics  has  appeared  to  be  intractable,  a  highspeed 
computer  has  been  used  to  study  the  problem  empirically.  So  far,  data 
have  been  obtained  only  on  cases  with  p  *  3,  the  and  ^jointly  normally 
distributed  and 
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where  J,  (n  x  1)  has  elements  all  1,  £  (n  x  1)  and  0  (n  x  n)  have  elements  all 
zero,  f"(n  x  n)  is  an  identity  matrisTand  p  (1  x  1)  is  the  correlation  between 
x2  and  Xj. 


* 


Using  sample  size  n  =  20,  100-200  samples  were  drawn  for  each  of 
several  t-oiiiuiiidLione  of  p  (0.00,  0.95)  ana  p-sets  having  dilierent  numbers 
(0,  1,  2,  3)  and  magnitudes  (i,  2,  5)  of  non -zero  elements.  Although  the 
results  exhibited  many  qualitative  and  some  quantitative  features  which 
were  not  unanticipated,  the  quantitative  features  were  in  general  pleasantly 
surprising  to  the  author.  All  three  criteria  were  better  for  selecting  the 
correct  subset  of  predictors  than  was  expected  on  the  basis  of  some 
approximate  and  apparently  naive  preliminary  considerations.  This  was 
particularly  true  for  the  cases,  all  P's  zero  or  the  non-zero  (3's  small. 

For  most  situations  studied,  C  was  superior  to  MC  and  MC  superior  to  MS. 
Exceptions  occurred  particularly  when  the  magnitudes  of  |3j  and  p^  were 

small  but  both  non-zero,  and  the  correlation  between  x^  and  was  .  95. 

Exclusive  of  these  exceptions,  criterion  C  resulted  in  57  -  100%  selection 
of  the  correct  subset  of  predictors,  and  MC  and  MS  resulted  in  30  -  100% 
correct  choice.  For  the  exceptions  noted,  however,  good  practical 
performance  of  the  criteria  was  still  obtained.  The  subset  of  predictors 
selected  as  best  simply  alternated  between  including  x.  and  x  rather  than 
both. 


2.  BACKGROUND. 

2.1.  General  orientation: 

There  is  a  point  which  needs  to  be  emphasized  before  focusing  on  the 
immediate  setting  of  the  results  to  be  presented.  It  is  this.  Given  a  vector 
of  observations  on  a  predictand  u  and  potential  predictors  _z  (j  *  1,  2,  .  .  .  ,tt ) , 

one  should  consider  any  theory  and  reasonable  supposition  regarding  the 
nature  of  relationship  of  u  to  the  z^.  It  is  often  not  sensible  to  assume  that 

u  =  of  (l)  and  that  z  =  x^  of  (l),  although  this  is  too  often  done.  Rather, 
it  may  be  proper  to  let  y  =  tj_  (u)  and  x  =  £  (as  ,  -  f^,).  Considera¬ 

tion  here  will  be  confined  to  situations  such  that  transformations  of  the 
observations  on  predictand  and  predictors  permit  expression  of  the  rela¬ 
tionship  as  in  (l). 

It  will  be  assumed  that  a  practical  situation  can  be  characterized  by 
certain  values  of  the  p.  and  by  the  form  and  the  parameters  of  the  joint 

distribution  of  the  x.  and  t.  More  /ecifically ,  it  will  be  assumed  (o  start 
that  1 
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A  given  set  of  data  then  represents  a  sample  from  the  x.,  e  population 
which,  with  the  (3.,  implies  y  .  1 

The  problem  is  to  examine  the  performance  of  the  aforementioned 
criteria  for  deciding  on  which  of  the  2?  subsets  of  the  x.  is  the  relevant 

subset  of  predictors  (i,e.  ,  which  of  the  2?  subsets  of  the  p^  consists  of 
only  and  all  the  non- zero  p,). 

2.2.  The  selection  criteria: 


Let  v  =  1,  2,  3,  ....  2P  index  the  subsets  of  the  jc.  Then  upon 
rearrangement  of  the  columns  of  [x^  x_,  .  .  .  x^  of  (l)  and  the  elements 
of  [U  .  .  .  p  1]  we  can  write 

1  u  P 


y  =  [Z  X  €]  [0  p  1] 

—  V  V  — 1  —V  —V 


where  the  columns  of  Z  (n  x  q  )  consist  of  the  vth  subset  of  the  x. ,  those 

v  v  —1 

of  (n  x  p-q^)  the  remaining  jc  ,  0_^  and  J3^  consist  of  the  correspondingly 

rearranged  p.,  and  q  =  0,  1,  2,  ....  p.  Then,  assuming  Z  ,  X  to  be  of 

full  rank,  the  total  sum  of  squares  ^  =  X'  X  can  Petitioned  into 
1  - 1  1 

S  =  (7  Z  )  Z  y  and  R  =  T  -  S  ,  and  R  carries  n  -  q  degrees  of 
V  V  V  v^“  V  V  v  ° 

freedom.  It  is  useful  to  note  that 

<5>  e<\>  ■  4aa  +  <■>-  v  • 
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where  A  =  X  '  [I  -  Z  (Z '  Z  )" 1  Z  '  ]  X  . 

v  V  V  V  V  v  V  * 

The  criteria  compared  were 

MSv  =  R/(n  -  qv) 

Cv  =  R -  n  +  2q 

MCy  =  R^s2  -  (n-p)  (n-qv-2)/(n-p-2) 

2 

where  s  is  R ^/(n-q^  for  the  case  in  which  Zy  contains  all  the  x^  . 

When  analyzing  data,  the  correct  subset  of  predictors  is  chosen  to  be 
the  one  among  the  2P  subsets  which  has  the  smallest  value  for  the  criterion 
being  considered. 

Under  the  assumption  £  =  0,  and  distributed  with  mean  0  and 
variance  I. 


E  (MSy)  =  1, 

E  (cv)  =  ( P‘% )  («-P)/(n-P-2)  +  2tlv"P 

~  q^  if  n  is  large , 

E  (MCv)  =  0. 


These  expectations  are  of  some  interest  when  studying  the  results  in  the 
next  section. 


3.  RESULTS,  As  indicated  in  the  introduction,  the  mathematics 
appeared  to  be  intractable,  so  performance  was  studied  empirically.  The 
scope  of  thiB  work  has  been  restricted  by  the  computational  capacity  avail¬ 
able  to  date,  but  programming  for  a  much  faster  computer  is  now  in  process 
and  it  will  be  possible  to  study  more  predictors  than  three.  The  current 
results,  some  of  which  are  shown  in  the  following  tables,  may  provide  some 
helpful  insight  toward  obtaining  at  least  an  approximate  analytic  solution. 
They  also  may  aid  in  constructing  a  sharper  criterion  for  selecting 
predictors. 
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choice 


led  C- statistic 


Ifrble  hm  Oamaxisoa  at  choice  by  residual  aean  aqmre  (MS),  C-statlstic  (c)  andjwglfled  C-Btatlstlc  .jjC) 


SAMPLE  CENSORING* 


N.  L.  Johnson 
Department  of  Statistics 
University  of  North  Carolina 
Chapel  Hill.  North  Carolina 

1.  INT ROD UC TION ■  There  are  currently  available  a  number  of 

methods  designed  to  reduce  the  possible  effects  of  "wild"  ("maverick") 
observations  on  the  analysis  of  sample  values.  Among  these  may  be 
mentioned  "trimming"  and  "Winsorisation".  These  methods  involve  the 
possible  or  sometimes  automatic  exclusion  of  extreme  values  among  those 
observed.  Apart  from  these  methods,  for  which  appropriate  statistical 
analyses,  taking  proper  account  of  the  omission  of  sample  values,  are 
available,  samples  may  be  incomplete  owing  to  inadequate  recording,  or, 
unfortunately,  biasted  selection  of  values  which  accord  best  with  some 
preconceived  ideas  or  desires. 

While,  under  properly  regulated  conditions,  information  on  any 
censoring  of  sample  values  should  accompany  the  records  of  the  values 
themselves,  this  is  not  always  the  case.  Indeed,  with  the  last  situation 
described  with  the  preceding  paragraph,  such  information  is  not  to  be 
expected;  but  also,  even  in  more  respectable  cases,  information  may  be 
omitted  by  negligence. 

The  problems  to  be  considered  in  this  paper  are  those  arising  when 
it  is  suspected  that  there  has  been  some  form  of  censoring  of  the  origins! 
sample.  Complete,  and  reasonably  tidy  solutions  are  obtained  only  on  the 
assumption  that  the  population  distribution  of  an  observed  character  is 
known.  However,  study  of  this  situation  does  give  some  clue  as  to  what 
can  be  done  when  knowledge  of  the  population  distribution  is  incomplete. 

Problems  of  a  similar  kind  have  been  discussed  in  an  earlier  paper  [l]  , 
They  were  of  a  rather  simple  nature  in  that  there  was  usually  a  direct 
choice  between  two  possible  sample  sizes. 

2.  FORMAL  STATEMENT  OF  PROBLEM.  It  will  be  supposed  that 

there  are  available  r  observations  of  a  character  (X)  which  may  be  regarded 

as  observed  values  of  random  variables  x  ',  xi,  .  .  .  ,  x  '.  These  are  a 

\  c  r 

sub- set  of  the  n  (>  r)  variables  x^',  x^,  .  . .  ,  corresponding  to  a  complete 

random  sample  of  (unknown)  size  n,  If  r  =  n,  then  the  'sub-set'  is  identical 
with  the  complete  sample.  We  will  be  interested  in  testing  whether  this  is, 
in  fact,  the  case.  Various  kinds  of  alternatives,  specifying  different  kinds 

‘"The  work  was  supported  in  part  by  Army  Research  Office  Contract  AROD-4, 
and  in  part  by  Air  Force  Contract  AF -AFOSR-760-65. 


of  censoring  which  might  be  applied  to  the  complete  sample,  can  be 
considered.  Certain  special  kinds  of  censoring  have  been  discussed 
in  earlier  papers  [2]  [3]  ,  and  the  results  of  these  investigations  will 
be  summarised  in  Sections  3  and  4.  Then,  in  Section  5,  we  will  consider 
problems  associated  with  general  types  of  censoring.  Certain  practical 
problems  arising  in  application  of  the  tests  described  in  Sections  3,  4, 
and  5  will  be  discussed  in  Section  6, 

Discussion  will  be  restricted  to  situations  in  which  x  ',  ' . x^1 

can  be  regarded  as  n  independent  continuous  random  variables,  with  a 
known  common  probability  density  function,  represented  by  rt x) . 

3.  SYMMETRICAL  CENSORING  OF  EXTREMES.  We  will  suppose 
that  if  censoring  occurs  it  takes  the  form  of  exclusion  of  the  s  greatest 
and  s  least  among  the  original  n  sample  values.  Then  x^1  ,  x, '  .  .  .  , 

are  the  r  central  values  among  an  original  set  of  n(=r+2s)  values. 
Denoting  this  hypothesis  by  H  the  joint  probability  density  function  of 

f  i  8 

the  r  ordered  variables  x,  <  x„  <  . .  .  <  x  (these  being  a  rearrangement 

.i  —  2  ■“  “  r 

of  x^',  x^,  . ,  ,  ,  x1  in  increasing  order  of  magnitude)  is; 


p(x.,  x,,  ....  x  [  H  )  *  [F(x.)]  S[l-F(x  )]  “  II  F(x  ) 

12  r  i  s.s  (iJ)Z  1  r  jal  j 


(X1  <x2  <  •••  <  *r) 


where 


The  hypothesis  that  there  has  been  no  censoring  and  therefore  that  the 
complete  sample  is  available  is,  in  the  notation  already  introduced,  H 
For  brevity  this  will  be  denoted  by  Hq.  °’° 

The  most  powerful  teit  of  H  against  the  alternative  H  has  a 
critical  (rejection)  region  of  the  form,  858 


I,.,)  >  Cp(x1 . xr|Ho) 


where  C  is  a  constant.  Whatever  be  the  value  of  a,  inequality  (2)  can  be 
written  in  the  form. 


(3) 


F(xl)  [1  -  F(xr)]  >  K 
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where  K  is  a  constant.  Inequality  (3)  does  not  depend  on  s,  so  the  test 
defined  by  this  inequality  is  uniformly  most  powerful  with  respect  to  H 
for  all  s  >  o;  i.e.  with  respect  to  any  symmetrical  censoring  of  the  ’ 
extremes  of  the  sample  values.  The  value  of  K  must  be  chosen  to  give  a 
required  level  of  significance ,  a  say,  when  H  is  true.  This  value 
depends  on  a  and  x,  and  may  be  denoted  by  K(a,  r).  Then 

(4)  Pr[F(Xl)  [1-F(xr)3  >  K(a,r)  |H0]  . 

Table  1  gives  a  few  values  of  K{a,r).  For 
r  >  10  the  approximations 
K.(0,  10,  r)  =  2.  65(r+l.  5)'2 
£(0.  05,  r)  *  4. l(r+2)"2 
K(0.  01,  r)  a  9.2(r+3.  5)'2 


give  useful  results,  Mathematical  analysis  connected  with  the  determina¬ 
tion  of  K(a,r)  is  contained  in  Appendix  I, 

A  discussion  of  the  evaluation  of  the  power  of  this  test  is  contained 
in  Appendix  II. 

TABLE  1 

Upper  100  a%  Significance  Limits  of  F(x^)[l  -  F(x  )] 


0,  05 

0,  01 

2 

0.  207 

0.  235 

3 

0,  150 

0,  195 

4 

0.  109 

0,  156 

5 

0, 0822 

0,  125 

6 

0.  0633 

0,  101 

7 

0. 0503 

0.0830 

3 

0. 0408 

0,  0692 

9 

0.  0338 

0.0585 
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4.  GENERAL,  CENSORING  OF  EXTREMES,  If  the  requirement  of 
symmetry  is  dropped  we  need  to  consider  hypotheses  of  form  H 

8  i  S 

o  r 

corresoondincr  to  evrlneinn  nf  th*  •  and  :  larzcct  individuals  in 

o  r  ® 

the  original  sample,  with  sQ  and  sr  not  necessarily  equal.  In  this  case 
there  is  no  longer  a  uniformly  mo6t  powerful  test  of  Hq.  There  is  a 
uniformly  most  powerful  test  of  Hq  with  respect  to  the  subclass  H0  8  g 

in  which  s  /s  (  ®  8  )  is  constant, 
o'  r  '  ' 


It  has  a  critical  region  of  form 

(5)  C^)]9  £l-F(xr)]  >  K(a,  r,  0) 


[if  =  0,  we  take  0  =  •*  and  replace  (5)  by  F(x^)  >  constant] 

To  obtain  a  significance  level  equal  to  a  ,  the  value  of  K(a,  r,  0),  given 
Hq  is  valid  (i.e.  there  is  no  censoring),  must  make  the  probability  that 
inequality  (5)  is  satisfied  equal  to  a  .  In  [3]  a  heuristic  method  proposed 
by  S.  N.  Roy  [4]  is  applied  to  suggest  a  possible  test  of  Hq  with  respect 
to  all  alternative  hypotheses  of  type  H_  „  (for  any  values  of  sn  and  s_). 

“o»  "r  ■  ~ 

This  calls  for  construction  of  the  union  of  regions  like  (5)  with  a  ~  a1,  over 
all  values  of  0.  Points  (F^),  Ffx^)  )  on  the  boundary  of  the  critical 

region  must  satisfy  the  equations. 

(6.1)  [F^)]6  [i-F(xr)3  aKfa1,  r,  0) 

(6.2)  ^  {  [F(Xl)]9  [1-F(xr)]  }=  9K(a',  r,  9)/8  6  . 


From  (6.  1)  and  (6.  2)  it  fu  lows  that 


(6.3)  log  F(xj)  =  8  long  K(a',  r,  0)/80  . 

If  K(a',  r,  0)  is  known,  T[x^)  can  be  found  from  (6.  3)  and  then  F(x^)  is 

determined  by  (6.  1).  However  explicit  evaluation  of  K(a',  r,  0)  is 
troublesome,  and  approximate  methods  were  used  in  [3]  leading  to  the 
simple  (through  approximate)  formula; 
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(?) 


F(xj)  +  [1  -  F(xr)]  >Kj  (a,  r) 


for  the  union  of  critical  regions.  Here  K.  (o,r)  represents  a  constant 
which  can  be  chosen  to  give  a  required  value,  a  say,  for  the  significance 
level.  (Note  that  a'  appears  only  in  the  construction  of  (7);  it  is  not  the 
significance  level  of  the  resultant  test, ) 

Although  an  approximate  argument,  applying  a  heuristic  principle 
has  been  used  in  reaching  (7),  the  critical  region  so  obtained  has  a  natural 
appeal;  and  seems  worthy  of  further  consideration, 

The  distribution  theory  associated  with  the  critical  region  (7)  is  very 
simple.  If  Hg  8  is  valid  then  F(x^)  4  [l  -  F(xr)]  is  distributed  as 

x2(so  +Sr  +2)/(*2(.o  +  sr  +  2)  +  x2(r-l))  where  X2(so  +  sr  4  2)  aild  x2(r-l) 

are  mutually  independent.  (Equivalently,  the  distribution  is  a  beta  distribu¬ 
tion  with  parameters  (s  +  s  4  2),  (r-1).  )  It  follows  (putting  s  =  s  *0) 
that  or  or 

(8)  Kj(a,  r)  *  upper  100  a%  point  of  beta  distribution  with  parameters 
2,  (r-1).  These  values  can  be  obtained  from  Table  16  of  [6]  .  r 

The  power  of  the  test  with  respect  to  a  specified  alternative  hypothesis 

H  is  also  easily  calculated.  In  fact 

s  ,  s 

o  r 


(9)  Pr[F(Xl)  +  (1-F(xr)  )  >K{  |  H§  ]  =  1 -IK  («o  4  sy  +2 ,  r-1 ) 

o*  r  1 

■'l-K,*'-1'  ■o+,r  +  2> 

where  1  (M,N)  =  [B(M,  N)]  ^  1  ^ ( 1  - 1 ) ^'  ^  dt  ie  the  Incomplete  beta 

p  J  0 

function  ratio. 


For  given  s^  and  s^,  as  r  tends  to  infinity  the  power  tends  to 


(10) 


Fr  U 


2(s  4 
'  o 


s  4  2)- 
r 


M,  1  -a 


2  2 
(where  v  t  denotes  the  upper  100  a  %  point  of  the  distribution  of  v 

V|  1  *0, 

with  v  degrees  of  freedom). 
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A  few  values  of  the  power  are  shown  in  Table  2.  It  appears  that  the 
asymptotic  (r  — *•  a)  values  give  a  good  indication  of  true  value  for  r  >  30. 


TABLE  2 


Power  (3  of  the  general  purpose  test  (a  =  0.  05) 


s  +  s 

o  r 

=  2 

6 

10 

14 

18 

r  *  4 

0. 167 

0.470 

0.716 

0.862 

0.938 

r  «  30 

0.281 

0.845 

0.989 

m  . 

- 

r  =  *» 

0.  303 

0.892 

0.996 

- 

- 

A  special  ca.^e  of  some  interest  arises  when  censoring  at  one  extreme 
only  is  suspected  (i.  e.  SQ  =  °  or  8r  *  o).  In  this  case  the  uniformly  most 

powerful  test  has  the  critical  region 

y  <  a^r  (if  s  =  o) 

’ t  'o' 

or 

Yl  >  l-al/r  (if  er  =  0)  . 


1 

The  power  of  the  test  with  critical  region  y  <  o  1  with  respect  to 


the  alternative  H  ,  is 
o  8 

r 


(r+s  )2  ra  '  s 

p(Ho‘s  )  afrTl)Ys  i  J  yr"  0-y)  r 


1/x 


dy 


=  IQl/r(r,  sr  +  1) 

(where  1  denotes  the  incomplete  beta  function  ratio). 

5.  GENERAL  CENSORING.  We  first  introduce  the  notation 
H  to  denote  the  hypothesis  that  s  observations  have  been 

V  V  i 

removed  between  x,  .  and  x.  for  j»l,2,  (r+1)  with  x  x 

j  - 1  J  o  r+1 
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In  this  notation  the  H  considered  in  Sections3  and  4  would  be 

8  t  S 

o  r 


H 


.  Also,  for  convenience  we  will  write 


a  >  w >  W|  *  •  •  i  w,  9 

o  r 


(ID 


Yj  *  F(x^) 


(j  =  l,  ...r) 


y  ■  0;  y 
o 


r+1 


Then  the  beat  critical  region  for  testing  the  hypothesis  of  no  censoring 
(H  )  against  the  alternative  H  is  of  form 

'  a  n  a  a'  •  8>8*,,.»8 

o  1  r 


O I  0$  •  e  s  f  Of  O 


(12) 


r  o . 

11  (yj+l  -  yj>  J  £  K(a*  r’  V  V  8r) 


j=o 


It  is  clear  that  there  is  a  uniformly  most  powerful  test  with  respect  to 

any  Bet  of  alternatives  H  for  which  the  ratios 

S  ,  8.  i  *  * ,  ,  B 

o  i  r 

s  Is.!...;  s  are  constant,  but  not  with  respect  to  any  other  sets 
o  1  r  r 

of  alternatives.  While  one  could  attempt  to  apply  Roy's  heuristic  principle, 

as  in  (5]  ,  to  construct  a  general  purpose  critical  region  for  the  whole  set 

of  alternatives  H  the  effect  of  approximations  might  well 

o  l  r 

be  much  more  important  in  the  more  general  case,  and  is  certainly  more 
difficult  to  gauge.  We  therefore  consider  more  or  less  arbitrarily  chosen 
criteria  which,  however,  do  have  some  relation  to  criteria  suggested 
from  theoretical  considerations. 

We  first  consider  a  test  with  critical  region 

r 

(13)  g  =  n  (yj+1  -  y^)  >  K2(a,  r)  =  K2  . 


It  is  quite  likely  that  this  criterion  may  be  felt  to  have  some  practical 
drawbacks.  These  will  be  discussed  in  Section  6,  but  for  the  present  we 
will  just  consider  how  to  evaluate  K2  in  (13),  at  any  rate  approximately. 

It  will  be  convenient  to  approximate  to  the  distribution  of  log  g,  rather 
than  of  g  itself.  The  moment  generating  function  of  log  g,  when 
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is  valid,  is 


S  }  0,  «iii  8 

O  1  I 


t  (r  +  =  OJ 

(14)  E  (g  |  H  ]  =  - 0-L-  x 

n  s. 

j*o  J 

*  i  S  ■■■  J  1  dyl  " '  arr  • 

(The  region  of  integration  is  0  <  y.  <  y  <  .  , .  <  y  <1.  Remember  that 

y  s  0  and  y  .,  =  1.1  * 

o  r+1 

Since  the  joint  probability  density  function  of  y. ,  . . .  ,  y  is 

1  r 


•  r(r+l+S  s  )  r  s, 

p(y,.  vr  I  h  . )  ■  - - 

°  i  r  it  M.  m  J-° 


n  r(8  +i) 
j=°  J 


it  follows  that 


(l5)  ]!•••!  ”  (yJ+l  •yJ)  ‘  dV"dyr  ’ 


n  r(.  +i) 

j=o  J 


r(r+l+Ss.) 

o  J 


and  hence  from  (14)  and  (15) 


(r+rs.):  *  r(t+.j+i) 

(16)  ECg^H  ]  =  -= - ^ — - - - 

“o’  V  •  *  *  '  "r  n  s  i  r(  (r+1)  (t+1)  +  S  s  ) 
jso  J  o  J 


Taking  logarithms  and  differentiating, 'the  following  expression  for 
the  mth  cumulant  of  log  g  is  obtained; 


In  particular  when  the  null  hypothesis  (=  H  )  is  valid 

(18)  K  m(log  g  |H0)  =  Cr+1)  -  (r+l)m  ^m‘l)(r+l). 

The  polygamma  functions  have  the  values 

(1)  a  -y  =  -  0.  5772 

and  ^^m_1^(l)  =  (-l)m(m-l)!  (m  >  2) 

where  S  =  1  +  2"m  +  3_m  +  .  .  . 
m 

Hence 

(19.1)  (c^-log  g|HQ)  *  (r+1)  (y+^(r+l)  ) 

(19.2)  «m(-log  g  J  Hq)  -  (r+1)  C(m-l)l  Sm  +  (-I)m"1(»+l)m”1  V (m”1>(r+l)] 

(in  :>  2)  . 

For  z  not  too  small,  we  have,  to  a  good  approximation 

(20.1)  Y(z)  =  log  (z  -  1/2) 

(20.2)  ¥^(*)  =  (-l)m_l  (m-l)J  (z  -  l/2)‘m  (m  >  1) 
whence 
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(21.1)  ^{-log  g  |H0)  =  (r+1)  (0.  57722  +  log  (r+ 1/2)  ) 

(21.2)  «m(-logg|H0)  =  (r+1)  (m-1)  l  [Sm  -  (m-1)'1  {(r+lj/fr+l/Z)}11*"1] 


Noting  that 


(i)  t>  :  least  possible  value  of  (-log  g)  is  (r+1)  log  (r+1),  correspond¬ 
ing  to  a  j/  (r+1)  for  j=l,  2,  ....  r 

C-3  (-log  «!"„)]  2  ,  UV1’2  7.35 

(il)  ■'■-m..  r.  .-  r  -I  in-  .  r.  S  — ■ - ....  -  - - 

t* 2  ("log  g  (  Hq)  J  3  (r+1)  (S2-l)3  ~ 

and 


*4(-log  g  |  H0)  L  4 _  IQ,  80 

«2(->»gg|H„)  ■  (r+1)  ‘  ril 

2  2  3 

(while  for  x  with  (r+1)  degrees  of  freedom,  k  =  8/(r+l) 

K  JK\~  12/(r+l)  ) 


(iii)  var(-log  g  |  Hq)  =  0.  645(r+l) 
while  var(0.  57722  x2r+1)  *  0.  666(r+l) 


it  appears  that  we  might  take,  as  an  approximation, 

(22)  -log  g  -  (r+1)  log  (r+1)  to  be  distributed  as  0.  57722  x  (x^  with 
(r+1)  degrees  of  freedom)  or,  equivalently 

(22)'  1,732  [  -log  g  -  (r+1)  log  (r+1)]  to  be  distributed  as  with 
(r+1)  degrees  of  freedom.  This  implies 


K2  ? 


eXpl>*2r+l,aA732] 


(r+1) 


r+1 


where 

2  2 
X  r+!,  a  is  the  lower  a%  point  of  the  distribution  of  x  with  (r+1)  degrees 

of  freedom. 
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(If  -log  g  -  (r  +  1)  log  (r+1)  is  approximated  by  0.  5587  \  i.0332(r+l)'  then 
means  and  variances  agree  while  the  values  of  k\/k?  and  *,/«?  for  the 
- _ distribution  are  7.74{x+l)“'  and  ii.oi(r+i)  *.) 


The  approximations  cannot  be  expected  to  be  good  unless  r  is  fairly 
large.  In  the  extreme  case  r  »  1  with  g  =  y^l-y^)  we  have  exactly 


(23.1)  Fr[g>G(H0]  =(1-4G)1/2  (0  <  G  <  l/4) 

while  (22)  gives 

(23.2)  Pr(g  >  G|H0]  =  1-(4G)°*866  . 


The  approximation  (23.2)  is  substantially  less  than  the  true  value  (23.  1) 
though  it  does  have  the  correct  limits  (l  and  0)  as  G  tends  to  0  or  1/4. 

In  order  to  assess  the  power  of  this  test  we  return  to  equation  (17).  Thi3 
gives  the  cumulants  of  log  g  when  a  general  alternative  hypothesis 

s  ,  s,  .....  s  is  valid.  It  would  seem  reasonable  to  fit  the  distribution 
oi  r  2 

of  [  -log  g  -  (r+1)  log  (r+1)]  by  that  of  a  multiple  of  x  >  so  that  first  and 
second  moments  agree.  It  may  be  that  better  approximations  to  upper 
percentage  points  of  -log  g  would  be  obtained  by  fitting  the  first  three 
moments  (instead  of  the  initial  point  and  first  two  moments  -  see  [4]  ). 
This  method  might  therefore  be  employed  when  the  power  is,  say,  above 
0.75. 


6.  MODIFIED  TESTS.  The  test  criteria  described  above  are  all 
based  on  the  probability  integral  transformation 


(24) 


y  = 


j*  fWdx  • 


They  explicitly  assume  that  f(x)  is  known  exactly  (in  practice  to  a  close 
approximation)  and  that  there  are  no  errors  in  observation  of  x.  This 
last  condition  is  never  satisfied  when  x  is  a  continuous  variable.  There 
is  always  some  kind  of  grouping  error  occasioned  by  the  finiteness  of  the 
number  of  digits  used  in  recording  the  observations.  This  is  particularly 
important  in  relation  to  test  functions  like  g  of  (13)  in  Section  5.  If  it  so 
happens  that  any  two  of  the  y's  are  equal  the  value  of  g  is  zero  and  the 
null  hypothesis  will  be  accepted.  Clearly,  if  this  happens  because  of  the 
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use  of  too  coarse  a  grouping  interval,  the  test  is  likely  to  be  very 
insensitive.  Furthermore,  the  larger  r  is,  the  more  likely  it  is  that 
at  least  two  x's  (and  so  two  y's)  will  be  equal,  thus  giving  rise  to  a  zero 
value  for  g.  Wr  are  thus  led  to  consider  modified  tests,  less  sensitive 
to  this  kind  of  effect.  A  simple  way  of  effecting  thiB  is  to  use  only  a 
selected  number  of  the  transformed  order  statistics  y. ,  y  ,  ....  y 

-say  y  y  y  (with  the  values  1  <  a.  <  a*  <  . .  .  <  a,  <  r 

t  *  •  •  *  9  l  c  K— * 

fixed  before  analyzing  the  data,  of  course)  and  to  apply  a  test  with  critical 
region 

...  ...  .  k+1 

(25)  ■  ga  -  n  (y  .  y  )  >  K3  , 

j*o  *  J-1 

with  y  a  1,  y  =  0.  (A  natural  choice  would  be  to  take  the  a's  at 
ak+l  ao  equal  intervals  apart.) 

The  value  of  depends  on  the  required  significance  level,  a  ,  and  also 

on  the  selected  a. 1  j ■ ,  as  well  as  on  r.  In  fact  the  distribution  of  g&,  when 

H  is  valid,  is  the  same  as  that  of  g,  with  r  replaced  by  k,  when 

H  is  valid  and  with  s.  *  a..,  -a,  -  1  (j«0,  1,  2,  k) 

B0‘  Bl . *k  J  J+l  J 

hence,  the  same  calculations  as  those  needed  to  evaluate  the  power  of  the 
test  using  g  are  required  in  calculating  the  value  K3  in  (25).  Also,  of 
course,  calculation  of  the  power  of  the  test  with  critical  region  (25)  will 
follow  the  same  lines. 

A  similar  kind  of  modification  can  be  applied  to  tests  of  symmetrical 
censoring  of  extremes  (Section  3).  In  this  case  it  would  be  natural  to 
ignore  the  least  and  greatest  m  observations,  and  use  only  ym+^  Yr  m> 

The  uniformly  most  powerful  test  of  Hq  against  symmetrical  alternatives 

s  has  a  critical  region  form  similar  to  (3),  vi2: 


y  ,1  (1-  y  )  >  K. 
7m+l  '  r-m  -  4 


Determination  of  is,  however,  more  troublesome  than  for  K.  The 
equation 


Evaluation  of  the  integral  of  the  left  hand  side,  with  K.  replaced  by  K, 

;  __  . ■ . ‘ .  . - 

gives  the  power  of  the  test  with  critical  region  (3)  with  respect  to  the 
alternative  hypothesis  H  .  The  notes  in  Appendix  II  are  therefore 
relevant  to  this  problem.  ’ 

7.  CONDITIONS  OF  APPLICABILITY.  It  may  be  felt  that  the 
condition  stated  at  the  beginning  of  Section  6,  namely  that  the  true 
probability  density  function  f(x)  must  be  known,  is  unlikely  to  be  satisfied 
in  practice.  While  this  is  so,  in  the  strict  sense  that  it  is  very  rarely 
the  case  that  a  theoretically  formulated  model  gives  an  exact  representa¬ 
tion  of  reality,  it  will  sometimes  be  the  case  that  there  is  sufficiently 
massive  evidence  to  establish  f(x),  from  observed  relative  frequencies, 
with  adequate  accuracy.  Slight  variations  in  form  of  f(x)  can  be  tolerated 
without  serious  effect,  particularly  if  a  modified  test  of  the  type  described 
in  Section  6  is  used.  It  may  be  noted  that  it  is  not  essential  that  f(x)  have 
a  simple,  or  indeed  any  explicit,  mathematical  form  -  a  graphical 
representation  can  suffice. 

It  would,  however,  be  interesting,  but  beyond  the  scope  of  the  present 
investigation,  to  inquire  into  the  robustness  of  these  tests  with  respect  to 
variation  in  f(x).  (i.  e.  to  use  of  an  incorrect  function,  f  (x)  say,  in  (24)  ). 
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Appendix  I 

We  have  to  consider  the  evaluation  of  K(a,  r)  from  equation  (4). 

Putins  v.  -  Ffx.Wa#  in  (llH.  the  loint  orobabilitv  density  of  v.  and  v  , 
~  'J  ‘  J  '  ’  '  •  *  *  ‘  r 

given  Hq,  is 

(A.  1)  p(y1#  yf  |HQ)  =  r(r-l)  (y^-yp'-2  (0  <  <  Yr  <  0  • 

Hence  K(a,r)  (now  written  as  K  for  convenience)  satisfies  the  equation 
(A.2)  r(r-i)  J  j"  (yr  -  yj)1*"2  dy1dyj 


■  a 


y^i-y)  >  k  . 

The  region  y^(l«y  )  >  K  can  be  defined  by  the  inequalities 

Yl  <  yr  <  1-K/y^  and  these  imply  also  1-y^  -  K/y^  >0or  Y-<y^<Y+ 


where  Y  «  [l  +  V  1-4K]  /2. 

Hence  from  (A.  2) 

rY+  ,  , 

(A.  3)  r  \  (1-Ky“  -y)r“l  dy  o  a  . 

Jy. 

Expanding  the  integrand  and  integrating  term  by  term  leads  to  the  equation 

(A.4)  r  X1  (r“l)  (-l)jKj  ^  1  (T\j~l)  (-1)1  h  (SU4K) 
j=o  J  i«o  J 

1+e , 


where  hQ(z)  a  log  (j— — ) 


hm(*)  ■  2"m  m'1  C(l+z)m-(l-z)m]  . 


Note  that  for  m  >  0, 
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b; 


(A.  5)  h  (VI-4K) 

m 

S  [m'1  K?]  VT^Sc 

°<J  J 

•  *.*%  {V  1 -4K)  . 

-m 


For  t  «  2(1)  9.  the  left  hand  side  of  (A.  4)  it  shown  in  Table  A.  1 
below. 


TABLE  A.  1 


-  log 


1  +  v  1  -4K 
1  -  V  1-4K 


1  +8K 


1+26K  1 2K  (1  +  K) 

1  +  ~  K  +  K2  20K(1  +  3K) 

1+97K  +  226K2  30K(l  +  6  K  +  2K2) 

1  +  ~  K  +  K2  +  K3  42K  (1  +  10K  +  10K2) 

9  3  9 


1+i!^K+®^K2+2^K3 

5  5  5 

,  .  10618  „  .  125634  Jl 
1  +-_  K  +  — 5J-  K 

.  218044  „3  .  32768  „4 
+  __  K  +  -55-  K 


56K  (1  +  15K  +  30K2  +  5K3) 
72K(l+21K+7oK2  +  35K3) 


_ - -  1  A  J  1  _iV 

(For  example  for  r  =  3,  1+8K)  vHTIk  -  6K  log  ■  =  a  .) 


1  -  V  1-iK 

The  calculations  rapidly  become  more  complicated  as  r  increase e. 
It  therefore  is  desirable  to  search  for  some  approximation  to  K(o.r) 
which  will  give  useful  results  for  r  large  (and  preferably  for  r  >  10). 


Some  empirical  formulae  have  been  given  in  Section  3.  Here  we  uae  an 
analytical  approach,  starting  from  equation  (A.  3).  We  first  make  a 
succession  of  transformations,  aimed  at  obtaining  an  integrand  far  whfah 
useful  bounds  can  be  set. 


Firstly,  putting  y  »  *  >/K 


where  A(K)  »  (l/2)  (lW  I-TkJ/Vk;  . 


Next  making  the  transformation  x  »  e  the  integral  becomes 
plog  A(K) 

/k  \  e  {1-Vk  (e  +e  )>  dt 

J-logA(K) 


/■•log  A(K)  *  .  .  , 

-Vk  f  {1-/K  (.'+.•*))'■'  dt 

J-log  A(K) 

(A.  6)  «2/K  jlog  (i-tfK  cosh  t)*"1  cosh  t  dt. 


Now  making  the  transformation  v  3  2'/~K  cosh  t,  we  obtain 


(A.  7) 


(v2  -  4K)“l/2v  dv  . 


Integrating  by  parts,  this  is  equal  to 

(A.8)  (r-l)  f1  (1-vrV  -  -*K)l/2  dv. 

J2VK 

Thus  equation  (A.  3)  can  be  written 

r(r-l)  C  (v2  -  4K)^2  (l-v)r”2  dv«  a  . 

J*/K 
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Making  the  final  transformation  v  =  2Vk  +  (1  -2Vk}u  we  obtain 
(A. 9)  r{r-l)(l-2'/K)r’1/2  £  {(1-2Vk)u2  +  2/K  u}1/2 


1  (l-u)r-2  du  »  a  . 


Since 


Vu  <  {(i -2/k)u2  +  2/k  u)  ^  <  /uITk  u  +  Storks* 


it  follow e  that 


{A.  10)  o-i/xr'/viTt  iffcffi)  <a 

As  can  be  deduced  by  direct  analysis,  K-*o  as  r  -*>« ,  but  since 

(l-2/K)r  <  o 

it  follows  that  K  >  1/4  (l-o^r)2  and  hence  Kr2  cannot  tend  to  sero. 

»2  «  . 

If  we  put  K  «  Cr  (where  C  is,  of  course  a  function  of  r  and  a)  then, 

approximately 
(A.  11) 


This  implies  that  C  lies  between  fixed  limits,  and  suggests  that,  for  large 
r,  K  is  of  the  form  C  r"^.  (The  form  of  function  -  C^fr+D^) -  used  as  an 

approximation  to  K  in  Section  3  was  suggested  by  this  analysis. ) 

An  alternative,  heuristic  approach  is  as  follows; 

If  H0  be  valid,  y^(l-y^)  ie  distributed  as  uv/(u  +  v  +  w)  where  u,  v  and 

w  are  independent  x2  random  variables  with  2,  2,  2(r-l)  degrees  of  freedom 
respectively. 
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If  r  is  Urge 


Pr 


r 


uv 

(u+v+w)2 


1 

J 


(since  w/[2(r-l)]  ~  1  as  r  -*•  •).  Hence  we  have  K  ~  Cr"2 
satisfies  the  equation 


exp  (-  £  u  -  2C/u)  du  =  a 


where  C 
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APPENDIX  II 


The  joint  probability  density  function  of  y.  and  y  ,  when  H 
4*  l  r 


(r+s  +  s  )i  s  h 

(A.  12)  p(yi,  yr)  »  ^-TK^y/Vi  yl 

o'  '  t 


(°  <  V\  <  Yt  <l)  • 


Hence 


'**trl(»-y,)>K|n,  ,,  1 

o  r 

(A*13)  ^r+g  g  j,  y+  1-K/y,  a 

'  o  r7  f  s  f  /71#,  v  r,  »r-2  , 

"s  !(r-2)is  -  Jv  yl  °  J  <Vy l5  dyrdyl 

®  11  *_ 

(where  Y+  ■  (l/2)  [l  +  VI  -  4K]  as  in  (A.  3). 

s  s 

Noting  that  (l-yr)  *  *  {  (l-yj)  -  (y^y^}  **  we  see  that  the  integral  in 
(A.  1 3)  is  equal  to 

yjV°  *  (jr)t‘l>JtI-vi,,’r  ,(r+J-‘)'1(1-K/y,-y1)I'J^i 


*  £  V°  I  (’r)(-l)J(l-y1)‘r'i(«j-l)'1  I  (ri'2H-l)‘Kly1-1(l-y1)r-2-ldy1 


*r  r’2  ,.,/■!  .  ?  i  <  rY+  -  i  *  +r-2-i-j 

(A.  14)  «  z  z  \  y,  (1-y,)  r  dyr 

j»o  i»o  VJ  ’  JY_  1 

Using  (A.  5)  this  can  be  expressed  explicitly  in  terms  of  K.  The  resulting 
formula  is  rather  cumbersome,  and  does  not  give  much  insight  into  the 
dependence  of  power  on  »q  and  sx>  The  following  alternative  approach, 

although  it  depends  on  some  quite  rough  approximations,  should  give  a 

reasonably  accurate  idea  of  the  nature  of  this  dependence,  when  r  is  large 

compared  with  s  and  s  . 

o  r 
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From  (17)  it  follows  that 


/  A  1  ft\  &  (  —1  r\rtS  ir  t%  -«*  \\\ 

— /  *  XXl%  — ew ?  y*  J  1 

■  (-i)mty(ra‘i>(.<!+i) + ♦  i) 

.3my(m-l)(io  +  j^r+J)J 


Using  the  approximate  formula  (20)  we  obtain 

•  *o  -1  Br  -1 

(A.  16. 1)  »c1(-log{y1(l-yr)}  )  *  2y  -  2  j  -  23  j  -log(r-l/2) 

1*1  j=l 


+  3  log(so+sf+r+3/2) 


and,  for  m  >  2 


(A.  16.2)  K  (-log{y  (1-y  )}  )  *  (m-1).'  [  23  j'm  +  23  j 
m  i  r  •  .  _ 


j**o+1 


.2 a 

j  r+1 


+  {(m-l)(r-l/2)m‘1}-1 


-  3m{(m-l)(so  +  .r+r+3/2)m‘i}- 


If  r  is  large,  then  for  the  smaller  values  of  m{  >  2) 


(A.16.3)  a  (-log  {y,(l-y  )}  )  =  (m-l)3  (23  ,-m  +  “  -m]  . 

■jb» J 


j*s  +1 
r 


Note  that  r  does  not  appear  in  this  approximation. 
In  particular,  taking  m  =  2 


(A.  17)  var(*log  (y^(l-yr)}  )  f  I  j"2  +  E  j 


j  aS  +1  ja*  +1 

J  o  *  V 


423 


The  variance  decreases  as  s  and  /or  s  increased.  The  expected  value 
(Kj)  also  decreases.  °  r 

A  further  approximation  to  (A.  16.  3)  gives 

(A.  18.1)  Km(-log  {yi(l-yr)»  i  (m-2)l  [(so  +  l/2)’tm*1,+(sr+l/2)^m"1)] 
andin  particular 


(A.  18.2)  «2(-log  ivfi'Yj})  ■  (»Q  +  i/2)"1  +  (s^+l/2)-1  . 

If  80  s  Br  =  s>  formula  (A.  18.  1)  becomes 

(A.  19.1)  «m(-log{yl(l-yr)})  =  2(m-2)J  (s+1/2)“^R1"1) 
while  (A.  16. 1)  becomes 


(A.  19.2) 


'«1(-fog{y1(i-yr)}) 


2y-2 


•  -1 

E  j”  -log(r-l/2)  +  3  log(2s4r+3/2). 

jrl 


If  r  is  large  this  last  equation  may  be  replaced  by 

•  -1 

(A.  19.3)  K^-logfy^l-y^}  =  2y -2  Z  j  +  2  log(r+3s+5/2)  . 

j“l 

If  s  increases  to  s+1,  k  ,  decreases  by  approximately  2(s+l)"1  -btr+Ss+S^)'1 
It  is  not  suggested  that  it  will  always  be  appropriate  to  use  these  approxi¬ 
mations,  particularly  those  appearing  later,  which  depend  heavily  on  r  being 
large  compared  with  sq  and  sy.  The  approximations  are  exhibited  because 

they  bring  out  rather  clearly  the  way  the  distribution  of  -log{y.(l-y  )} 
depends  on  sq  and  s^.  1  r 
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